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@ A leader sequence to promote the secretion of gene products. 

@ A recombinant DNA transfer vector contains a leader 
sequence polynucleotide which codes for a signal polypep- 
tide of formula I, 

Met - Arg - Pro - Ser - He - His - Arg - Thr - 

Ala - lie - Ala - Ala - Val - Leu - Ala - Thr - I 

Ala - Phe - Val - Ala - Gly - Thr 

Preferably the transfer vector is a ptasmld. In one pre- 
ferred embodiment the leader sequence polynucleotide is 
downstream of and in reading phase with 8 bacterial or yeast 
promotor and a ribosome binding she, and upstream of and in 
reading phase with 8 structural gene. The structural gene may 
be, for example, the carboxypeptidase Gt(CPGt) gene from 
the chromosomal DNA of Pseudomonas species strain RS - 
16. Examples of plasmlds containing the leader sequence 
polynucleotide and the CPG> gene are pNM1, pNMHI, 
P NM14. pNM21, pNM22, pNM31, pNM32 and pLEC3. 
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LEADER SEQUENCE TO PROMOTE THE SECRETION OF GENE PRODUCTS 

The present invention relates to fragments of specific 
deoryribonucleotide sequences that promote the secretion of gene 
products from cells and in particular to recombinant MA transfer 
vectors that contain these fragments. 

Recent developments in biochemistry have led to the construct- 
ion of recombinant UNA transfer vectors in which, transfer vectors, 
for example plasmids, are made to contain exogeneous DM. In some 
cases the recombinant incorporates heterologous UNA that codes for 
polypeptides that are ordinarily not produced by the organism 
susceptible to transformation by the recombinant vehicle • 

In its basic outline a method of endowing a micro organism 
with the ability to synthesise a new protein involves three, 
general steps: / \ 

(a) isolation and purification of the specific gene or nucleo- 
tide sequences containing the genetically coded information 
for the amino acid sequence of the desired protein or 
polypeptide, 

(b) recombination of the isolated gene or nucleotide sequence 
with an appropriate transfer vector, typically DNA of a 
bacteriophage or plasmid to form a recombinant transfer 
vector that podes, in part, for the production of the 
desired protein or polypeptide, 
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(c) transfer of the vector to the appropriate micro organism 
and selection of a strain of the recipient micro o rganism 
containing the desired genetic information. 
Provided the gene or nucleotide sequence expresses its 
5 protein or polypeptide in the chosen micro organism, growth of the 
micro organism should then produce the desired protein or poly- 
peptide in significant quantities ♦ 

Once the micro organism has been cultured, the protein or 
polypeptide must he isolated from the undesired materials. This 

10 step is considerably facilitated if the majority of the desired 

protein or polypeptide iB present in the culture medium and/or the 
periplastic Bpace of the micro organism* In other words purifica- 
tion may he performed in a more efficient manner if, once 
expressed, the protein or polypeptide passes through the cell 

15 membrane and out of the cytoplasm. 

The passage of the protein or polypeptide through the cell 
membrane is desirable for two main reasons. First the desired 
protein or polypeptide will generally be foreign to the micro 
organism in which it is expressed. In many cases, therefore, it 

20 will be quickly broken down by proteolytic enzymes etc in the cells 
cytoplasm and will, subsequently, have a short half life within the 
cell. By transferring the protein or polypeptide out of the cyto- 
plasm soon after expression the stability of the protein or poly- 
peptide will be greatly increased. Second the number of unwanted 

25 genetic materials and products (from which the desired protein or 
polypeptide must be isolated) will be far greater in the cel^s 
cytoplasm than in the culture medium and/or in the cell f s peri- 
plastic space. It can be seen that on both of the above counts 
the transfer of the protein or polypeptide through the cell membrane 

30 and out of the cytoplasm will greatly facilitate protein or poly- 
peptide isolation. 

One way in which the secretion of gene products from the 
cell 1 s cytoplasm may be promoted is to produce, within the cyto- 
plasm, a preprotein or prepolypeptide in which the desired protein 

35 or polypeptide is preceded by a signal polypeptide. The pre- 
dominantly hydrophobic signal polypeptide directs the desired 
protein or polypeptide to the cell l s periplasmic space, where the 
signal peptide is removed as the desired protein or polypeptide 
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traverses the cell Membrane ♦ 

Many of the known signal peptides contain cysteine residues. 
These residues have been found to react in the cell membrane and 
thereby inhibit the efficient transfer of the desired gene product 
5 out of the cell. 

It is the primary object of the present invention to provide 
recombinant ENA transfer vectors containing a leader sequence poly- 
nucleotide that codes for a signal peptide that is cysteine free. 
Other objects and advantages of the present invention will become 
10 apparent from the following description thereof. 

According to the present invention there is provided a re- 
combinant 3JNA transfer vector comprising a leader sequence poly- 
nucleotide coding for Bignal polypeptide of formula I ; . 

Ket-Arg-Pro-Ser-lle-His~Arg-!Thr- Ala-lie- Ala- I 
1 5 Ala-7al-Leu-na-Thr-Ala-Phe-Val-Ala-Gly-Thr 

The transfer vector may be a bacteriophage or, which is 
preferred, a plasmid. 

Preferably the majority of the codons in the nucleotide 
sequence are those preferred for the expression of microbial 
20 genomes. Suitable codons are listed in UK 1,568,047 and UK 

2007675A, and these publications are therefore incorporated herein 
by reference. 

In one preferred embodiment of the present transfer vector the 
nucleotide sequence has formula II 
25 5 f - ATG CGC CCA TCC ATC CAC CGC ACA 

GCC ATC GCC GCC GTG CTG GCC ACC II 
GCC TTO GTG GCG GGC ACC - V 

The nucleotide sequence coding for the signal polypeptide (the 
leader sequence poly nucleotide) will preferably be downstream of and 
50 in reading phase with a bacterial or a yeast promoter and a prokary- 
otic ribosome binding site in the transfer vector. Moreover the 
leader sequence polynucleotide will either be upstream of an insert- 
ion site for a structural gene or, which is preferred, will be up- 
stream of and in reading phase with a structural gene coding for a 
desired protein or polypeptide. Preferably the gene codes for a 
eukaryotic, particularly a mammalian, proteiin pr polypeptide. 

The structural gene may code, for example, for such eukaryotic 
proteins as human growth hormone, human insulin or human chorionic 
somatomammotropin. Alternatively it may code for such prokaryotic 
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proteins as E.coli ^B-galactoBidase or Pseud am onas car boxy peptidase 

(CPG^) (Carboiypeptidase is on enzyme, produced by Pseudomonas 

species strain ES-16, that has application in cancer chemotherapy * 
2+ 

It is a Zn containing dimer of 2 x 42,000 daltons and has high 
affinities (Em values of 1<T^ or lO^ 6 **) for both 5-aethyltetra- 
hydrofolate, the predominant ciculatory form of folate in mammal b and 
for the folic acid antagonist methotrexate (MX), which is widely used 
in cancer chemotherapy. The enzyme may be used directly for the 
plasma depletion of reduced folates, essential as co- factors in purine 
and particularly in pyrimidine biosynthesis. CPG^ has been shown to 
inhibit the development of the Walker 256 carcinoma in vivo and to 
remove MIX from circulation in patients where prolonged exposure to 
high doses of MHC leads to toxicity). 

Examples of transfer vectors according to the present invention 
that code for CPGg are pNM1, pUMIH, pHK14, pHM21, pNM22, pNM31,pNM32 
and pLEC3. 

The promoter is preferably a high expression bacterial or yeast 
promoter for the structural gene in a variety of hosts. The particular 
choice of promoter will depend on the microorganism to be transformed. 
For example the transformation of E.coli vill generally be effected by 
a transfer vector in which an E.coli promoter controls the expression 
of the structural gene. Examples of E.coli promoters are those present 
in the plasmids pBH 322 and pAT 153. By contrast, the transformation 
of Pseudomonas species will generally be effected by a transfer 
vector in which a Pseudomonas promoter controls the expression of the 
structural gene. Examples of Pseudomonas promoters are those present 
in the plasmid pET 230 or Pseudomonas chromosom a l Ma. 

In order to express the structural gene the present transfer 
vector will be transformed into a suitable microorganism. According 
to a further aspect of the present invention therefore there is 
provided a microorganism transformed by a recombinant UNA transfer 
vector according to this invention. The microorganism will preferably 
be a bacterium or yeast in which high expression of the structural 
gene, within the transfer vector, occurs. Depending on the choice of 
promoter the microorganism may be a strain chosen from one of the 
following bacteria E.coli, Pseudomonas and Bacillus or the yeast 
Saccharomyces cerevisiae. 

Having transformed the microorganism, the protein or polypeptide, 
for which the structural gene codes, may then be expressed by culturing 
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the transformed microorganism in a culture medium. It is the primary 
advantage of the present invention that culturing the transformed 
microorganism affords a preprotein or prepolypeptide in which the 
desired protein or polypeptide is preceded by the present signal 
polypeptide. This means that soon after expression the signal 
polypeptide directs the desired protein or polypeptide to the cell's 
periplasmic space, where the signal polypeptide is removed as the 
desired protein or polypeptide traverses the cell membrance. Since 
the present signal polypeptide is free of cysteine residues the 
desired gene product will be efficiently secreted through the membrane. 

The present transfer vectors may be prepared by any of the methods 
that are well known in the recombinant UNA art* For example the 
leader sequence poly nucleotide may be synthesised by the modified 
triester method of K.Itakura etal, JACS, 1975, 97» 7327 or by the 
improved oligodeoxynucleotide preparation described in UK 2007675A. 
The disclosure of both of these references is incorporated herein by 
reference. The synthesised polynucleotide may then be inserted in a 
transfer vector, preferably a plasmid. In the transfer vector it will 
preferably be downstream of and in reading phase with a bacterial or a yeast 
promoter and a prokaryotic ribosome binding site. The leader sequence 
polynucleotide should also be either upstream of a structural gene 
insertion site or upstream of and in reading phase with a structural 
gene. 

Alternatively, UNA fragments containing the leader sequence 
polynucleotide may be obtained from natural sources, in particular from 
the chromosomal UNA of Pseudomonas species strain ES-16. In this 
particular case a polynucleotide (formula II above) coding for the 
present signal polypeptide immediately precedes a structural gene 
coding for CPG^. A number of the UNA fragments containing this leader 
sequence polynucleotide may therefore be recognised by their ability, 
on insertion into a plasmid and transformation of a microorganism by 
the resultant recombinant vector, to enable a microorganism to grow on 
folate. Examples of such recombinant transfer vectors that contain both 
a polynucleotide coding for the present signal polypeptide (formula II 
above) and a structual gene coding for CPG^ are pNM1 , pNM111, piNM14f 
pNM21 f pNM22, pNMJI, pNM32 ani pLEC3. Of coarse, dice a Fol + recombinant 
vector has been obtained in this way it may be subcloned to afford 
alternative vectors (either Pol" 1 " or Fol") that also contain a 
polynucleotide coding for the present signal polypeptide. 
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Once a suitable UNA fragment has been isolated it may then be 
inserted in a transfer vector, preferably a plasmid. In the transfer 
vector the leader sequence polynucleotide on the inserted fragment 
should be downstream of and in reading phase with a bacterial or a yeast 
promoter and a prokaryotic ribosome binding site. The leader sequence 
polynucleotide should "also be either upstream of a structural gene 
insertion site or upstream of and in reading phase with a structural gene. 

The structural gene for insertion downstream of and in reading 
phase with the present leader sequence polynucleotide may be obtained, 
for example, by the synthetic methods mentioned above (thiB is 
particularly useful for the preparation of genes coding for small 
proteins, such as h uman growth hormone, insulin and human chorionic 
Bomatomammotropin,) Alternatively the structural gene may be prepared 
from m=ENA by the use of the enzyme reverse transcriptase or may be 
isolated from natural Bources (chromosomal UNA) • 

An example of the latter method is the isolation of USA fragments 
containing a polynucleotide sequence (shown in Table 1) coding for the 
enzyme CPG^ ( amino acid sequence also shown in Table 1) from 
Pseudomonas species strain RS-16 chromosomal UNA. Eramples of plasmids 
containing a CPG^ structural gene, as well as a polynucleotide coding 
for the present signal polypeptide (formula II above), are pNMl, pMM111, 
pHK14» pNM21, pNK22, pNM31 , pNK32 and pLEC3. 

Once prepared or isolated the Leader sequence polynucleotide and 
the structural gene will be inserted into a transfer vector, preferably 
a plasmid, to form a recombinant MA transfer vector according to the 
present invention. The insertion step or steps will preferably be 
effected by one of the well known techniques in this art tnat employ 
restriction endonucleases, see for example the methods discussed in 
UK 2090600A, the disclosure of which is incorporated herein by 
reference. The choice of transfer vector will be determined by the 
microorganism in which the leader sequence polynucleotide and 
structural gene are to be expressed. Generally the transfer vector 
will be a cloning vehicle that is suitable for transforming the chosen 
micro-organisms and that displays a phenotypical characteristic, such 
as antibiotic resistance, by which the recombinant transfer vectors may 
be selected. Thus, if the micro-organism is to be E-coli, then 
suitable transfer vectors will be the E-coli plasmids pBR322 and 
pAT153. Alternatively, if the micro-organism is to be Pseudomonas, 
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TABLE 1 

A Polynucleotide Sequence, coding for CPG^ isolated from 

PEeudoaonaa species strain RS - 16 chromosomal MA 

1 

Met Arg Pro Ser lie Kis Arg Thr 

y - ATG CGC CCA TCC ATC CAC CGC ACA 

10 

Ala lie Ala Ala Val Leu Ala Thr Ala 

GCC ATC GCC GCC GTG CTG GCC ACC GCC 

20 

*Phe Val Ala Gly Thr Ala Leu Ala' Gin 

TTC GTG GCG GGC ACC GCC CTG GCC CAG 

Lys Arg Asp Asn Yal Leu Phe Gin Ala 

AAG CGC GAG AAC GTG CTG TTC CAG GCA 

40 

Ala Thr Asp Glu Gin Pro Ala: Val lie 

GCT ACC GAC GAG CAG CCG GCC GTG ATC 

Lys Thr Leu Glu Lys Leu Val Asn He 

AAG ACG GTG GAG AAG CTG GTC AAC ATC 

60 

Glu Thr Gly Thr Gly Asp Ala Glu Gly 

GAG ACC GGC ACC GGT GAC GCC GAG GGC 

70 

He Ala Ala Ala Gly Asn - Phe Leu Glu 

ATC GCC GCT GCG GGC AAC TTC CTC GAG 

80 

Ala Glu Leu Lys Asn Leu Gly Phe Thr 

GCC GAG CTC AAG AAC CTC GGC TTC ACG 



Val Thr Arg Ser Lys Ser Ala Gly Leu 

GTC ACG CGA AGO AAG TCG GCC GGC CTG 

90 

Val Val Gly Asp Asn He Val Gly Lys 

GTG GTG GGC GAC AAC ATC GTG GGC AAG 



100 

lie Lys Gly 

ATC AAG GGG 

110 

Leu Leu Met 

CTG CTG ATG- 

Tyx Leu Lys 

TAC CTC AAG 

Pro Phe Arg 

CCG TTC CGC 

Tyr Gly Pro 

TAC GGC CCG 

Gly Gly Asn 

GGC GGC AAC 

Leu Lys Leu 

CTC AAG CTG 

Arg Asp Tyr 

CGC GAC TAC 

Phe Asn Thr 

TTC AAC ACC 

180 

Phe Gly Ser 

TTC GGC TCG 

i90 

Glu Ala Lys 

GAA GCC AAG 

200 

Ser Phe Glu 

TCC TTC GAG 



8 

Arg Gly Gly 

CGC GGC GGC 

Ser His Met 

TCG CAC ATG 
120 

Gly lie Leu 

GGC ATT CTC 

130 

Val Glu Gly 

GTC GAA GGC 

140 

Gly lie Ala 

GGC ATC GCC 

Ala Yal lie 

GCG GTC ATC 

Leu Lys Glu 

CTG AAG GAA 

Gly Thr lie 

GGC ACC ATC 

Asp Glu Glu 

GAC GAG GAA 

Arg * Asp Leu 

CGC GAC CTG 

Leu Ala Asp 

CTG GCC GAC 

Pro Thr Ser 

CCC ACC AGO 
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Lys Asn Leu 

AAG AAC CTG 

Asp Thr Val 

GAC ACC GTC 

Ala Lys Ala 

GCG AAG GCC 

Asp Lys Ala 

GAC AAG GCC 

Asp Asp Ly3 

GAC GAC AAG 

150 

Leu His Thr 

CTG CAC ACG 

160 

Tyr Gly Val 

TAC GGC GTG 

170 

Thr Val Leu 

ACC GTG CTG 

Lys Gly Ser 

AAG GGT TCC 

lie Gin Glu 

ATC CAG GAA. 

Tyr Val Leu 

TAC GTG CTC 

Ala Gly Asp 

GCA GGC GAC 
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Glu 
GAA 



Lys 
AAA 



Leu 
CTG 



210 
Ser 

TCG 



Leu 
CTG 



Gly 
GGC 



Thr 
ACC 



Ser Gly 
TCG GGC 



lie 
ATC 



Ala 
GCC 



Tyr 
TAC 



Val 
GTG 



220 
Gin 

CAG 



Val 
GTC 



Asn 
AAC 



lie 
ATC 



Thr 
ACC 



Gly 
GGC 



Lys 
AAG 



Ala 
GCC 



Ser 
TCG 



His 
CAT 



230 
Ala 

GCC 



Gly 
GGC 



Ala 
GCC 



Ala 
GCG 



Pro 
CCC 



Glu 
GAG 



Leu 
CTG 



Gly 
GGC 



Val 
GTG 



Asn 
AAC 



240 

Ala 
GCG 



Leu 
CTG 



Val 
GTC 



Glu 
GAG 



Ala 
GCT 



Ser 
TCC 



Asp 
GAC 



Leu 
CTC 



Val 
GTG 



Leu 
CTG 



250 
Arg 

CGC 



Thr 
ACG 



Met 
ATG 



Asn 
AAC 



lie 
ATC 



Asp 
GAC 



Asp 
GAC 



Lys 
AAG 



Ala 
GCG 



Lys 
AAG 



260 
Asn 

AAC 



Leu 
CTG 



Arg 

CGC 



Phe 
TTC 



Asn 
AAC 



Trp 
TGG 



Thr 
ACC 



lie 
ATC 



Ala 

GCC 



Lys 
AAG 



270 
Ala 

GCC 



Gly 
GGC 



Asn 
AAC 



Val 
GTC 



Ser 
TCG 



Asn 
AAC 



lie 
ATC 



lie 
ATC 



Pro 
CCC 



Ala 
GCC 



280 
Ser 

AGC 



Ala 
GCC 



Thr 
ACG 



Leu 
CTG 



Asn 
AAC 



Ala 
GCC 



Asp 
GAC 



Val 
GTG 



Arg 
CGC 



Tyr 
TAC 



290 
Ala 

GCG 



Arg 
CGC 



Asn 
AAC 



Glu 
GAG 



Asp 
GAC 



Phe 
TTC 



Asp 
GAC 



Ala 
GCC 



Ala 
GCC 



Met 
ATG 



300 
Lys 

AAG 



Thr 
ACG 



Leu 
CTG 



Glu 
GAA 



Glu 
GAG 



Arg 
CGC 



Ala 
GCG 



Gin 
CAG 



Glu 
CAG 



Lys 
AAG 



510 
I^ys 

AAG 



Leu 
CTG 



Pro 
CCC 



Glu 
GAG 



Ala 
GCC 



Asp Val Lys 

GAC GTG AAG 

Arg Pro Ala 

CGC CCG GCC 

Gly Lys Lys 

GGC AAG AAG 

Ala Tyx Tyr 

GCC TAC TAC 

Leu Gly Val 

CTG GGC GTG 

560 

Gly Thr Asp 

GGC ACC GAC 

570 

Ser Gly Lys 

TCA GGC AAG 

380 

Gly Leu Pro 

GGC CTG CCG 

Asp Lys Ala 

GAC AAG * GCC 

« 

Ala lie Pro 
GCG ACT CCG 

Ala Arg Leu 
CGC CGC CTG 



10 

320 

Val lie Val 

GTG ATC GTC 

Phe Asn Ala 

TTC AAT GCC 

Leu Val Asp 

CTG GTC GAC 

Lys Glu Ala 

AAG GAA GCC 

Glu Glu Arg 

GAA GAG CGC 

Ala Ala Tyr 

GCG GCC TAC 

Pro Val lie 

CCA GTG ATC 

Gly Phe Gly 

GGC TTC GGC 

390 

Glu Tyr Val 

GAG TAC GTG 

400 

Arg Arg Leu 

CGC CGC CTG 

410 

lie Met Asp 

ATC ATG GAT 
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Thr Arg Gly 

ACG CGC GGC 

330 

Gly Glu Gly 

GGC GAA GGC 

340 

Lys Ala Val 

AAG GCG GTG 

350 

Gly Gly Thr 

GGC GGC ACG 

Thr Gly Gly 

ACC GGC GGC 

Ala Ala Leu 

GCC GCG CTC 

Glu Ser Leu 

GAG AGO CTG 

Tyr His Ser 

TAC CAC AGC 

Asp lie Ser 

GAC ATC AGC 

Tyr Met Ala 

TAC ATG GCT 

Leu Gly Ala 

CTG GGC GCC 



Gly Lys 

GGC AAG TGA - 3' 
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Amino acids 1 to 22 are the present signal polypeptide 
Amino acids 25 to 415 are the CPG 2 structural gene 

MB The leader sequence polynucleotide is the preferred polynucleo- 
tide of formula II. 
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then a suitable transfer vector will be Pseudomonas pkT230. 

The present recombinant UNA transfer vectors, micro-organisms 
transformed by the present recombinant DHA transfer vectors and 
processes for the preparation of said vectors and micro-organisms will 
5 now be described by way of example only, with particular reference to 
the Figures in which: 

Figure 1 is a restriction enzyme cleavage Bite map of pNM1, 

Figure 2 is a restriction enzyme cleavage site map of pHM111, 

Figure 3 is a restriction enzyme cleavage Bite map of pHM14, 

10 Figure 4 is a restriction enzyme cleavage Bite map of pKM21, 

Figure 5 is a restriction enzyme cleavage site map of pNM22, and 

Figure 6 illustrates the process for the preparation of a recombinant 

plasmid containing both the present leader sequence polynucleotide nnd the 
p-Galactosidase structural gene, and 

Figure 7 is a restriction enzyme cleavage site map of pI£C3. 
15 Materials and Methods 

Bacterial strains and plaremldB 

The bacterial strains used were Escherichia coli W5445 ( pro 

leu thi thr sup E44 lac Y ton Arm Str ) Pseudomonas putida 

2440 (r~) and Pseudomonas sp strain RS-16. The plasmids employed were 

20 P BR322 (F Bolivar et al Gene, 1977t 2, 95) 1 pAT153 (A J Twigg et al, 

Nature, 1980, 283, 216) and pKT230 (M Bagdasarin et al , Gene 1981, 16, 
237) and pR0G5 (R.P.Sherwood et al , The Molecular Biology of Yeast, 1979 
Cold Spring Harbor Publications), 
Media and culture conditions 

E.coli was routinely cultured in L-broth (1% tryptone , 0.5% yeast 
25 extract, 0.5%NaCl). Solidified medium (ir-agar) consisted of L-broth 
with the addition of 2% (w/v) agar (Bacto-Difco). Antibiotic con- 
centrations used for the selection of transformants were 50 ug/ml 
ampicillin, 15 Jig/ml tetracycline and 30 ug/ml kanamycin. In the case 
of E.coli these were conducted in 2YT liquid medium (1.696 tryptone, 
30 1% yeast extract, 0.5% NaCl) containing 1% glucose, and 0.0596 folate 
where appropriate • The pseudomonads were grown in a irnn-tTnai salts 
solution consisting of per litre: MgSO^, 0.05g; CaCl^, 2^0, 0.05g; 
FeS0 4 .7H 2 0, 0.005g; MnSO^ 0.001 5g; Na^o^ 2H 2 0, 0.001 5g; KI^PO^ 5g; 
K 2 HP0 4 .3H 2 0, 12g; glutamate, 10g. The minimal medium employed for 
35 E.coli was M9 medium (J Miller, Experiments in molecular genetics, 
Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1972). 
Purification of UNA 

Plasmids were purified from chloramphenicol amplified cultures 
(D B Clewell, J Bacteriol, 1972, 110, 667) by Brij-lysis (B B Clewell 
et al, Proc Natl Acad Sci, USA, 1969f 62, 1159) and subsequent caesium 
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chloride-ethidium bromide density gradient centrifugation (A Colman 
et al, Eur. J Biochem, 1978, 91 1 303)* A rapid, small scale plasmid 
isolation technique (Buraboim et al, Huc.Acids Res, 1979f 7, 1513) was 
also employed for screening purposes. Chromosomal DMA from the donor 
Pseudomonad strain (RS-16) was prepared essentially as described by 
J Mannar, J.Mol.Biol, 1961, 3, 208. 
Restriction, ligation and transformation methods 

Restriction endonucleases and DBA ligase were purchased from Bethesda 
Research Laboratories and used in the buffers and under the conditions 
recommended by the supplier. Transformation of E.coli was essentially 
as described by S N Cohen et al., Proc.Katl.Acad.Sci. , USA, 1972, 69, 
2110, while Ps.putida was transformed by the method of M Bagdasarian and 
K N Timmi8, Current Topics in Microbiology and Immunology, Eds 
P H Hof Schneider and W Goebel, Springer Verlag, Berlin, 1981, p 47. 
Agarose gel electrophoresis 

Digests were electrophoresed in 0.8% agarose slab gels (10 cm x 20 cm x 
0.5 cm) on a standard vertical system (Raven), employing Tris-borate- 
EDTA buffer. Electrophoresis of undigested DNA was at 125V, 50 mA for 
3 hours, while digested UNA was electrophoresed at 15V, 10 mA for 
16 hours. Fragment sizes were estimated by comparison with fragments 
of TlDNA digested with Hindl ll and XDHA cut with both HinflT TI and 
EcoRI. Fragments were isolated from gels using electroelution 
(M V McDonnell et al, Proc. Natl.Acad.Sci, USA, 1977, 74, 4835). 
Determination of carboxypeptidase G 2 activity 

Bacteria were grown in 1 litre batch culture and 100 ml samples taken 
at various stages in the growth phase. Samples were cooled on ice, 
centrifuged at 13,000 x g for 10 minutes and re suspended and frozen 
in 5 ml of 0.1 M Tris HC1, pH 7*3 containing 0.2 mM ZnSO^. The cells 
were disrupted using a MSE Ultrasonic Disintegrator (150 V) at medium 
frequency, amplitude 2, for three 30-second intervals on ice. Cell 
debris was removed by centrifugation at 10,000 x g for 5 minutes. 
CPG 2 activity was determined after J L McCullough et al, J.Biol. Chem, 
1971, 246, 7207. A.lal reaction cuvette containing 0.9 ml of 0.1 M 
Tris-HCl,pH 7-3 plus 0.2 mM ZnSO^ and 0.1 ml of 0.6 mM methotrexate was 
equilibrated at 37°C. Enzyme extract was added to the test cuvette 
and the decrease in absorbance at 320 nm measured using a Pye-Uhicam 
SP1800 double-beam spectrophotometer. Enzyme activity per ml extract 
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was calculated as A. 320 nm absorbance/mi n divided by 8.3, which 
is equivalent to the hydrolysis of 1 jimol of MTX/min at 37°C. 
Protein concentration was determined by the method of M M Bradford, 
AnalBiochem, 1976, 72, 248. 

Cell fractionation techniques 

Bacterial cultures were grown in the low phosphate medium of H C 
lieu and L A Heppel, (J Biol Chem, 1964, 240, 3685), supplemented 
with 100 ug/ml ampicillin, to an 0D^^ = 1.0. 40 ml of culture was 
centrifuged at 5000 g for 10 min, washed in 5 ml of 10 mM Tris-HCl 
pH 7.0, and re suspended in 0.9 ml 0.58 M sucrose, 0.2 mM DOT, 30 mM 
Tris-HCl pH 8,0. Conversion to spheroplasts was achieved by the 
addition of 20 pi of lysozyme (2 mg/ml), 40 jlL 0.1 K EDTA, and 
incubation at 23° for 10 min (HC Neu et al, J Biol Chem, 1964, 239, 
3893). The spheroplasts were placed on ice and 0.1 ml of 30% (v/v) 
BSA added, followed by 5 ml of sucrose-tris buffer. Sedimentation 
of the spheroplasts was achieved by centrifugation at 5000g for 10 
min and the supernatant retained as the ^periplasmic* fraction. 
The pellet was resuspended in 5 ml 10 mM Tris-HCl, 0.2 mK DTT pE 
7.0 and and sonicated at 20 Kc/sec, 2 Amps for 15 sec. Remaining 
whole cellB were removed by centrifugation at 1000 x g for 10 min. 
Centrifugation at 100000 x g for 1 hr, at 4°C, separated the soluble 
(cytoplasmic) proteins from the particulate (membrane-bound) 
proteins. The membrane pellet was resuspended in 1 ml of 10mM 
Tris-HCl, 0.2 mM DTT, pfl 7*0. 

CPGg was assayed as described. Alkaline phosphatase was assayed 
according to J Killer, Experiments in Molecular genetics, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York, 1972, NADH 
oxidase according to M J Osborn et al, j Biol Chem, 1972, 247, 3962 
and glyceraldehyde - 3 - phosphate dehydrogenase after K Suzuki et 
al, ISMS, 1971, 13, 217. 
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Example 1 

Preparation of recombinant plasmid pNJCL (A plasmid containing both 
the present leader sequence polynucleotide and the CPG 2 structural 
gene) 

5 To isolate the gene for carboxypeptidase G 2 together with the 

leader sequence polynucleotide chromosomal ENA prepared from the 
PseudomonaB host (strain BS-16) was partially digested with Sau3A 
and fragments of between 6-8 Md isolated from agarose gels by 
electroelution. The •sized 1 DNA was ligated with alkaline phosph- 

10 atase treated BamKL cut piBR522, transformed into E.coli V5445, and 
Ap transformants selected. Of the 3,500 Ap r colonies obtained, 
approximately 70# were TC 8 . Utilisation of a rapid plasmid isolation 
technique on 50 Ap r TC B transformants demonstrated that 90% of the 
gene bank harboured plasmids of the expected size. As a further 

15 check on the authenticity of the gene bank, the individual clones 
were screened for the acquisition of a Leu + phenotype. Two such 
clones were identified. Both carried a plasmid capable of trans- 
forming leuB (B-isopropylmalate dehydrogenase) E.coli mutants to 
prototrophy. 

20 Acquisition of a functional CPG^ gene should enable E.coli 

to utilise folic acid as a carbon source. The 2,400 gene bank 
clones were screened for the ability to grow on minimal medium 
containing folate as the sole source of carbon (ie Fol + ). A 
single Pol + clone was detected and shown to harbour a plasmid 

25 capable of transforming plasmid-minus W5445 to the Fol + phenotype. 
Classical restriction mapping of this plasmid (pNMl) was under- 
taken which revealed the presence of a 5.9 Md insert of pseudom- 
onad UNA within pBR322. The restriction enzyme clear age site map 
of pNMl is given in Figure 1. The nucleotide sequence of the 

30 leader sequence polynucleotide and the CP& 2 structural gene is 
given in Table 1. 
Example 2 

Subclonlng of plasmid pNKL to form pKMlll 

In order to pinpoint the position of the CPG 2 gene and the 
35 leader sequence polynucleotide within the 5-9 Md insert, sub- 
cloning of various restriction enzyme fragments, into pBR322, was 
undertaken. A functional CPG_ gene was shown not to occur on 
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Xhol or Sphl fragments of tne pKMl insert, but was present on a 
3.1 Md Bglll fragment. This latter fragment was cloned into the 
BfrnTH site of pBR322 to give pNKLl (6.0 Md). A further reduction 
in the size of pNKll was achieved by digesting with Sail and 
5 religating the resultant fragment to yield pNMlll. In addition, 
plasmids in which the smaller 0.95 Mi Sail fragment had become 
inserted in the opposite orientation to the parent plasmid (pNMll) 
were Pol". Taken together these subcloning results indicate that 
the CPGg gene and the leader sequence polynucleotide lie between 
10 the Bglll site at 4.14 and the SA11 site at 6.03 on pNTCL. 

Furthermore, the gene contains a Sphl (5*1?)» Sail .(5-07) and 
at least one Xhol (4.56 and/or 5*56) site. The restriction 
enzyme cleavage site map of pNMlll is given in Figure 2. 
Example 3 

15 Preparation of recombinant plasmid pMK14« (A plasmid containing 
both the present leader sequence polynucleotide and the CPG-^ 
structural gene ) 

The 3.1 Mi Bgl II fragment from Example 2 above was partially 
digested with Sau3 A. These fragments were then cloned into the 

20 Bam HI site of pAT153 and transformed into E coli V5445- Of the 
two Ap r Tc B Fol + colonies obtained, one contained a plasmid which 
had acquired an extra Sal I and Bam HI site, this was pNM 14. 
The restriction enzyme cleavage site map of pNM 14 is given in 
Figure 3. Sequencing of the leader sequence polynucleotide and 

25 the CPG 2 structural gene present in pNM 14 gave the nucleotide 

structure shown in Table 1. DNA sequencing of pNM 14 also revealed 
that the Sal I - Bam HI fragment was a duplication of a segment of 
BNA from within the insert (marked * on Figure 3) composed of two 
contiguous Sau 3^ fragments. 

30 Example 4 and 5 

Preparation of recombinant plasmids pNM 21 and pNM 22 

( Plasmids containing both the present leader sequence polynucleotide 

and the CPG^ structural gene) 

The 3.1 Md Bgl II fragment from Example 2 was cloned into the 
35 Bam HI site of pAT 153 and transformed into E coli V 5445* Two 

Ap r Tc B Fol + colonies were obtained, one containing a plasmid pKM 21 
in which the fragment was inserted in the opposite orientation to 
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pNM1 and one containing a plasmid pNM22 in which the fragment 
was inserted in the same orientation as pNM1. The restriction 
enzyme cleavage site maps of pNM21 and pNM22 are given In 
Figures 4 and 5 respectively, 
5 The two plasmids, pNM21 and pNM22 both transformed E.coli 

to Pol , indicating that a pseudomonad promoter was present on 
the 3.1M fragment. However, cells carrying the plasmid pNM21, 
in which the BgUl fragment was cloned in the opposite orientation 
to pflMl, exhibited more rapid growth with folic acid as the sole 

10 carbon source. This difference was clearly visible on agar medium, 
where colonies developed concentric yellow ■halos 1 of precipi- 
tated pteroic acid, the insoluble product of folate hydrolysis. 

Confirmation that pNM21 gave enhanced expression of CPG^ 
over pNM22, was obtained by assaying enzyme production during 

15 batch growth of cells containing either plasmid. (The cells were 
grown in complex medium supplemented with *\% (w/v) glucose and 
where appropriate 0.09?6 (w/v) folic acid. The generation time 
was 56-66 min. The culture was sampled at hourly intervals and 
whole cells were disrupted by sonication. Enzyme activity was 

20 determined in the centrifugal extract). Eesults are given in 
Table 2. 

The expression of CPG^ from the plasmids pHK22 and pHM1 was 
2.5 units/litre of culture, representing 0.005% soluble protein. 
In contrast, expression from pNM21 was 2000-5500 units/litre of 
25 culture, which represented 4«7% soluble protein. As the cloned 
gene is inserted into the BamH T site of pAT153, the observed 
higher expression of pNM21 is almost certainly due to transcript- 
ional read through from the Tc promoter. The low expression of 
CPG 2 carried on plasmids pNM1 and pNM22 is consistent with the 

30 view that Pseudomonas promoters function poorly in E.coli . It 
is also apparent from Table 2 that in the presence of folate 
there is a two-fold increase in the specific activity of enzyme 
measured in cell sonicates. This phenomenon has been observed 
in all experiments, but does not seem to be associated with 

35 classical induction of the CPG 2 gene, as overall enzyme yield in 
the presence or absence of folate remains at about 3000 u/litre 
culture. It in fact reflects a consistent depression in the 
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soluble protein levels measured in sonicates from cells grown in 
the presence of folate. There is no obvious difference in growth 
rate of cells grown with folate and the reasons for this result 
axe not clear. 

TABLE 2: CAEBOXYPEPTIDASE G PRODUCTION BY E COLI V5445 

, CONTAINING THE PLASMTDS pNMl, pNM21 and pNM22. 



CULTUBE CABBOXXPEPTIDASE G ? SPECIFIC ACTIVITY (u/WS SOLUBLE 

PROTEIN) 



AOS 




pNM1 


pHM22 




pKM21 




(HE) 


-POL 


+P0L 


-POL +F0L 


-POL 


+P0L 


1 










11.5 


13.4 


2 










12.9 


9.6 


3 


.008 


.005 


.010 


.019 


13.9 


23.3 


4 


.009 


.011 


.015 


.013 


12.3 


26.9 


5 


.007 


.019 


.016 


.016 


11.5 


25.6 


6 


.005 


.024 


.014 


.023 


13-7 


24.1 


7 


.015 


.029 


.024 


.043 


. 13.2 


20.6 


8 


.013 


.028 


.024 


.046 


13.0 


23.6 



Expression of the cloned gene in PB.putida 

The observation that the CPG^ gene was expressed in E.coli 
regardless of the orientation of the gene within the vector, 
suggested that the promoter region of the CPG 2 gene had been cloned 
with the structural gene and the leader sequence polynucleotide. 
The low expression of CPG 2 within E.coli from its natural promoter 
(pNMl, pNK22, pNM11l) confirmed other findings that Pseudomonas 
promoters are poorly recognised by E.coli BNA polymerases. It would 
be expected that if the gene was introduced back into a pseudomonad 
cellular environment , then improved expression from the Pseudomonas 
promoter should result. The 3.1 Md BgUI fragment was subcloned 
into the Pseudomonas cloning vector pKT230 at its single BamEE site. 
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Two plasmidB were obtained, pNM31 and pNM32 representing the two 
pOBBible orientations of the cloned gene. These plaamids were 
transformed into Ps.putida 2440 by the method of Bagdasarian and 
Timmis. Pseudomonad cells carrying both plasmidB were cultured in 
TMn-tmpI salts medium and enzyme production monitored. 

Yields of 500-1000 units/litre of culture were obtained regard- 
less of gene orientation within the plasmid. Specific activity of 
the enzyme in cell sonicates was 1.5 to 4.0 U/mg protein represent- 
ing 0.3 to 0.7^ soluble protein (compared with < 0.05% Boluble 
protein in the donor strain RS-16). This result strongly indicates 
that the CPG^ promoter is present and operating in a pseudomonad 
background. When the same plasmids were transformed into E.coli 
V5445 12-40 Units/litre were found at specific activity <0. 07 U/mg 
(<0.01% soluble protein). 
Periplasmic localisation of CPG^ 

There is evidence that CPGg is located in or near the peri- 
plasmic space of Pseudomonas strain ES-16. Pteroic acid, the 

product of CPG hydrolysis of folic acid is extremely insoluble 
2 

and is found predominantly outside the cell in both liquid and solid 
media. Exogeneous pteroic acid is also seen in E.coli cultures 
containing the cloned gene when folic acid iB present in the medium. 
This is clearly demonstrated "by the , halo , of precipitated pteroic 
acid observed around colonies carrying plasmids in which expression 
of CPG 2 is from the Tc promoter of pBR322 (eg pNM2l). 

The localisation of CPG^ produced by E.coli cells carrying 
pNM21 was examined by the separation of cellular proteins into cyto- 
plasmic, periplasmic, and whole membrane fractions. As a control, 
levels of three marker enzymes, alkaline phosphatase (periplasmic), 
glyceraldehyde-3-phosphate dehydrogenase (cytoplasmic) and HADH.O^ 
oxidoreductase (membrane-bound), were also determined. As can be 
seen from Table 3 97# of the CPG,, activity occurs in the periplasm, 
equivalent to the marker peripla smi c enzyme, alkaline phosphatase. 
ThiB confirms the presence in pNM21 of a leader sequence polynucleo- 
tide next to the CPG,, gene that codes for a signal polypeptide 
according to this in vention that promotes the secretion of CPG^ 
from the cytoplasm into the periplasmic space. 
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Carboxypeptidase synthesised In E.coli 

The specific activity of CPG^ in crude cell extracts of cells 
carrying pNM21 was 50-fold higher than equivalent extracts from 
Pseudomonas strain BS-16. To determine whether the cloned gene= 
5 product in E.coli had the same properties as CPG^ from the 

pseudomonad, enzyme was purified from E.coli carrying pKK21. The 
specific activity of purified GPG^ (single hand SDS-PAGE) was 535 
U/mg of protein, which compares to 550 tf/mg of protein from the 
pseudomonad. GPG^ purified from E.coli clone pNM21 co-chromato- 

10 graphed with CPG^ from Pseudomonas strain BS-16 at a sub-unit 

molecular weight value of 42,000 daltons. Em vallues using metho- 
trexate as substrate were 7*4 x 10~Sl and E.0 x 10~^M respectively. 
In addition, antiserum raised against the Pseudomonas enzyme 
indicated immunological identity between the E.coli and Pseudomonas 

15 CPG^, as a confluent precipitation line was formed on Ouchterlony 
double diffusion analysis. 

Localisation of Carboxypeptidase 



ENZYME ACTIVITY 

FRACTION 0PG o IP GAP33H NADHQX. 



Periplasmic 97-0 97.1 6.8 0.25 

Cytoplasmic 2.6 2.3 93 8.4 

Kembrane-bound 0.4 0.6 0.2 89. 1 



AP = Alkaline phosphatase 

GAPJH = Glyceraldehyde-3--pkosphate dehydrogenase 

NADE0X = HADH.0 o oxidoreductase 
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Example 6 

Preparation of a recombinant plasmid containing both the present 
leader sequence polynucleotide and the B-Galactosidase structural 
gene 

Plaamid pNM14 (Example 3) vas treated with Sau 3A (GATC) and 
the fragments were cloned into the Bam HI site of M13 mp7 template 
DNA (single stranded ENA (Step A of Figure 6). The product carrying 
a 318bp Sau 3^ fragment coding for the present signal polypeptide 
and the first 22 amino acids of CPG 2 (nucleotide sequence of this 
fragment shown in Table 4) was selected and made double stranded. 
The DNA coding for the signal polypeptide (and the first part of 
CPG 2 ) was then excised as an Eco EI fragment. This Eco EI fragment 
was then cloned into the promoter cloning vector E.coli pMC1403 
(M.J. Casadaban et al, J Bacterid, 1980, 143» 971)> which carries 
only the structural gene (lac Z) for B-galactosidase (ie no 
promoter and no ATG start codon) (Steps B and C of Figure 6). 
Plasmids were obtained in which the Eco EI fragment had inserted in 
both orientations, however, only those in which fusion of the CP& 2 
sequence to the B-galactosidase sequence had occurred (i) yielded 
a 0.34 Kb fragment upon digestion with BamHI; (ii) enabled the host 
cell to hydrolyse the colourless lactose analogue, BCIG, and impart 
a blue colouration to colonies. The 0.34 Kb BamH I fragment has been 
recloned into M13mp7 and sequenced to confirm that fusion has 
occurred. The •precursor* fusion produced will consist of the 
signal peptide, the first 22 amino acids of CPG^, 6 amino acids 
derived from the M13mp7 and pMC1403linker units, and B-galactosidase 
from its 8th amino acid onward. 

Localisation experiments have been performed on cells carrying 
a plasmid coding for the •fusion gene 1 where the cellular proteins 
have been fractionated into periplastic, cytoplasmic and membrane 
fractions. In these experiments an organism (E,coli MC 1061) which 
is deleted for the lac Z gene was grown in phosphate medium (E.C. 
Neu et al, J Biol Chem, 1964> 240, 3685) and periplastic enzymes were 
released from the harvested cells by conversion to spheroplasts. 
Separation of Boluble proteinB (cytoplasmic) from particulate 
proteins (membrane band) was achieved by sonicating the harvested 
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Bpheroplasts and subsequent centrifugation at 100,000g for 1hr, 
to Bediment the cell membrane (T, J.Silhary et al, Proc Natl Acad 
Sci USA, 1976, 73, 3425). 

The results given in Table 5 demonstrate the presence of 50% 
of the B-galactosidase activity in the periplasmic space. This 
result is in direct contrast to similar work involving fusion of 
other periplasmic protein signal sequences to B-galactosidase , 
where the fusion proteins are not exported, but become jammed in 
the membrane (P.J. Bassfordet al, J Bacteriol, 1979> 139, 19 and 
S D finr et al, J Cell, Biol, 1980, 86, 701). 

TABLE 4 

The Polynucleotide Sequence of the 318 bp Sau 3A Fragment from 
Becombinant Plasmid pNH14 

5« - G ATC CAC GCA CTG AAG GCG CGC GGC 

AAG ACG CGC GGC GTG GCG ACG CTG TGC 

ATC GGC GGG GGC GAA GGC ACC GCA GTG 

GCA CTC GAT TGC TAT AAG AAC CAT GGC 

TGG GGA CGC CCG ACA ACA GGC GTC CAC 

CAG CTT TTT TCA TTC CGA CAA CCC GAA 

CGA ACA ATG CGT AGA GCA GGA GAT TCC 



Met 
ATG 
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Table 4 (coatd) 



Arg 
CGC 



Pro 
CCA 



Ser 
TCC 



lie 
ATC 



Hie 
CAC 



Arg 
CGC 
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Thr 
ACA 



Ala 
GCC 



lie 
ATC 



Ala 

GCC 



Ala 

GCC 



Val 
GTG 



Leu 
CTG 



Ala 

GCC 



Thr 
ACC 



Ala 

GCC 



Phe 
TTC 



Val 

GTG 



Ala 
GCG 



Gly 
GGC 



Thr 
ACC 



Ala 
GGC 



Leu 
CTG 



Ala 
GCC 



Gin 
CAG 



AAG 



Arg 
CGC 



Asp 

GAC 



Asn 
AAC 



Val 

GTG 



Leu 
CTG 



Phe 
TTC 



Gin 
CAG 



Ala 
GCA 



10 



15 



Ala 
GCT 



Thr 
ACC 



Asp 
GAC 



Glu 
GAG 



Gin 
CAG 



Pro 
CCG 



Ala 
GCC 



Val 
GTG 



lie 
ATC 



KB. This fragment carries the leader sequence coding for the 
signal polypeptide, a part of the GPG 2 structural gene coding 
for the first 22 amino acids of the protein, the ATG start 
codon, the CPG 2 ribosome binding site (AGGA.) and other components 
of the CPG 2 promoter region. 



TABLE 5 

Localisation of Signal Peptide - B-galactosidase Fusion 
Protein 







% LOCALISATION 






CPG 










/B-GAL 


AP 


GAPDH 


NADHQX 


Periplanmic 


50.3 


97.3 


3.4 . 


0.4 


Cytoplasmic 


30.9 


2.5 


95-3 


8.2 


Membrane-bound 


18.8 


0.2 


1.3 


89.4 



CPG^B-GAL 
AP 

GAPDH 



average results from 4 experiments 

Carboxypeptidase Gg-B-galactosidase fusion protein 
Alkaline phosphatase 

Glyceraldehyde-3-phosphate dehydrogenase 
KADH.Og Oxidoreductase 
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Kxample 7 

Preparation of a recombinant plasmid, containing both the present 
leader sequence polynucleotide and the GPG^ structural gene, able 
to replicate in E.coli and S. cerevisiae 

A 2.03 kilobase 3a:nTT I fragment coding for the present signal 
polypeptide and the entire CPG 2 molecule was cloned in both orient- 
ations into the B gmTrr site of an E. coli/S. cerevisiae shuttle vector 
pR0G5 (R.F. Sherwood and R.E. Gibson, The Molecular Biology of Yeast, 
1979» Cold Spring Earbor Publications) to give plasmids pLEC3 and 
pLEC4 (Figure 7). These plasmids were transformed into S. cerevisiae 
strain LL20 by the lithium acetate induced transformation method 
described by Ito et al., J. Bact., 1963, J53, 163. Yields equivalent 
to 10-20 units/litre of culture volume were obtained regardless of 
gene orientation within the plasmid. Specific activity of the enzyme 
in total cell extracts was 0 # 2-0.3u/mg protein representing 0.005^ 
soluble protein. This level of expression from the pseudomonad 
promotor in a yeast background is similar to the level found when the 
gene was reading from its own promotor in E.coli ( 0.01?o soluble protean). 

Localisation experiments have been performed on yeast cells 
carrying the above plasmids by sphaeroplasting the cells using standard 
techniques described by J.B.D. Beggs, Nature, 1978, 275 > ™5. ?eri- 
plasmic enzymes, localised outside of the cell membrane, were released 
when the cell wall was removed. The osmotic stabiliser (1.2K 
sorbitol) was then replaced by 0.1M Tris-HCl buffer, pE 7*3 containing 
0.2mK ZnCl 2 to lyse the sphaeroplasts and the whole centrifuged at 
100,000 x g for 1 hour to separate proteins in the soluble cytoplasmic 
fraction from membrane bound proteins. The results in Table 6 
demonstrate the presence of 64>o of the CPG^ activity in the peri- 
plasmic fraction and a further 16£o associated with the cell membrane 
fraction, 

TABLE 6 

Localisation of CPG^ in S. cerevisiae 

% CPG^ activity 
Periplasmic 64 
Cytoplasmic 20 
Membrane bound 16 



ff% I .IT * f)F% /* 
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JX/6425/Burope 

CLAIMS 

1. A recombinant DKA transfer vector comprising a leader 
sequence polynucleotide characterised in that the leader 
sequence polynucleotide codes for a signal polypeptide of 
formula I, 

5 Ket - Arg - Pro - Ser - lie - Eis - Arg - Thr - 

Ala - lie - Ala - Ala - Val - Leu - Ala - Thr - I 
Ala - Phe - Val - Ala - Gly - Thr 

2. A recombinant UNA transfer vector according to claim 1 
characterised in that the leader sequence polynucleotide is 

10 of formula II, 

5^- ATG .CGC CCA TCC ATC CAC CGO ACA 

GCC ATC GCC GCC GTG CTG GCC ACC II 
GCC TTC GTG GCG GGC ACC - 3 ^ 

3. A recombinant DIvA transfer vector according to either 
15 claim 1 or claim 2 characterised in that the leader sequence 

polynucleotide is downstream of and in reading phase with a 
bacterial or a yeast promoter and a prokaryotic ribosome 
binding site, 

4. A recombinant DIIA transfer vector according to any one 
20 of claims 1 to 3 characterised in that the leader sequence 

polynucleotide is upstream of and in reading phase vith a 
structural gene. 

5- A recombinant DKA transfer vector according to claim 4 
characterised in that the structural gene codes for human 
25 growth hormone, human insulin or human chorionic somato- 
mammotropin. 

6. A recombinant Dllk transfer vector according to claim 4 
characterised in that the structural gene codes for S.coli 
p - galactosidase. 

30 n 

7. A recombinant DkA transfer vector according to claim 4 
characterised in that the structural gene codes for 
Pseudomonas carboxypeptidase G 2 (CPG 2 ). 

8. A recombinant DHA transfer vector according to claim 7 
comprising a polynucleotide of formula 
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5 « - 

Ala 
GCC 



1 

Met 
ATG 

10 
lie 

ATC 



Arg 
CGC 

Ala 
GCC 



Pro 
CCA 

Ala 
GCC 



Ser 
TCC 

Val 
GTG 



lie 
ATC 

Leu 
CTG 



Hie 
CAC 

Ala 

GCC 



Arg 
CGC 

Thr 
ACC 



Thr 
ACA 

Ala 
GCC 



Phe 
TTC 

Lys 
AAG 

Ala 
GOT 

Lys 
AAG 

Glu 
GAG 

lie 
ATC 

Ala 
GCC 



Val 
GTG 

Arg 
CGC 

Thr 
ACC 

Thr 
ACG 

Thr 
ACC 

Ala 
GCC 

Glu 
GAG 



20 
Ala 

GCG 

Asp 
GAC 



Asp 
GAC 

Lett 
CTG 

Gly 
GGC 

Ala 

GCT 

Leu 
CTC 



Gly 
GGC 

30 
As ii 

AAC 



Glu 
GAG 

Glu 
GAG 

Thr 
ACC 

Ala 

GCG 

Lye 
AAG 



Thr 
ACC 

Val 
GTG 



40 
Gin 

CAG 



Lys 
AAG 

Gly 
GGT 

Gly 
GGC 

Asn 
AAC 



Ala 

GCC 

Leu 
CTG 



Pro 
CCG 

50 
Leu 

CTG 



Asp 
GAC 

Asn - 
AAC 

Leu 
CTC 



Leu 
CTG 

Phe 
TTC 



Ala 
GCC 

Val 
GTC 

60 
Ala 

GCC 



Phe 
TTC 

Gly 
GGC 



Ala 
GCC 

Gin 
CAG 

Val 
GTG 

Asn 
AAC 

Glu 
GAG 



70 
Leu 

CTC 



Phe 
TTC 



Gin 
CAG 

Ala 
GCA 



lie 
ATC 

lie 
ATC 

Gly 
GGC 

Glu 
GAG 

80 
Thr 

ACG 



Val 
GTC 

90 

Val 
GTG 



Thr 
ACG 

Val 
GTG 



Arg 
CGA 

Gly 
GGC 



Ser 
AGC 

Asp 

GAC 



Lys 
AAG 

Asn 
AAC 



Ser 
TCG 

lie 
ATC 



Ala 
GCC 

Val 
GTG 



Gly 
GGC 

Gly 
GGC 



Leu 
CTG 

Lys 
AAG 
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lie 
ATC 

Leu 
CTG 

Tyr 
TAG 



100 
Lys 

AAG 

Leu 
CTG 

Leu 
CTC 



Gly 
GGC 

110 
Met 

ATG 

Lys 
AAG 



Arg 
CGC 

Ser 

TCG 

120 
Gly 

GGC 



Gly 
GGC 

His 
CAC 

lie 
ATT 



Gly 
GGC 

Met 
ATG 

Leu 
CTC 



Lys 
AAG 

Asp 
GAC 

Ala 
GCG 



Asn Leu 
AAC CTG 



Thr 
ACC 

LyB 
AAG 



Yal 
GTC 

Ala 
GCC 



Pro 
CCG 



Phe 
TTC 



Arg 
CGC 



Val 

GTC 



130 
Glu 

GAA 



Gly 
GGC 



Asp 
GAC 



Lys Ala 
AAG GCC 



Tyr 
TAC 



Gly 
GGC 



Pro 
CCG 



Gly 
GGC 



lie 
ATC 



140 

Ala 

GCC 



Asp 
GAC 



Asp LyB 
GAC AAG 



Gly 
GGC 



Gly 
GGC 



Asn 
AAC 



Ala 
GCG 



Yal 
GTC 



lie 
ATC 



150 
Leu 

CTG 



His 
CAC 



Thr 
ACG 



Leu 
CTC 



Lys 
AAG 



Leu 
CTG 



Leu 
CTG 



Lys 
AAG 



Glu 
GAA 



Tyr 
TAC 



160 
Gly 

GGC 



Val 
GTG 



Arg 
CGC 



Asp 
GAC 



Tyr 
TAC 



Gly 
GGC 



Thr 
ACC 



lie 
ATC 



Thr 
ACC 



Yal 
GTG 



170 
Leu 

CTG, 



Phe 
TTC 



Asn 
AAC 



Thr 
ACC 



Asp 
GAC 



Glu 
GAG 



Glu 
GAA 



Lys 
AAG 



Gly 
GGT 



Ser 
TCC 



180 

Phe 

TTC 



Gly 
GGC 



Ser 
TCG 



Arg 
CGC 



Asp 
GAC 



Leu 
CTG 



lie 
ATC 



Gin 
CAG 



Glu 
GAA 



Glu 
GAA 



190 
Ala 

GCC 



Lys 
AAG 



Leu 
CTG 



Ala 
GCC 



Asp 
GAC 



Tyr 
TAC 



Yal 
GTG 



Leii 
CTC 



Ser 
TCC 



Phe 
TTC 



200 
Glu 

GAG 



Pro 
CCC 



Thr 
ACC 



Ser 
AGO 



Ala 
GCA 



Gly 
GGC 



Asp 
GAC 



Glu Lys Leu 

GAA AAA CTC 

lie Ala Tyr 

ATC GCC TAC 

Gly Ly 0 Ala 

GGC AAG GCC 

Pro Glu Leu 

CCC GAG CTG 

Glu Ala Ser 

GAG GCT TCC 

Ket Abii He 

ATG AAC ATC 

Leu Arg. Phe 

CTG CGC TTC 



28 
210 

Ser Leu Gly 

TCG CTG GGC 

220 

Yal Gin Yal 

GTG CAG GTC 

230 

Ser His Ala 

TCG CAT GCC 

Gly Yal Am 

GGC GTG AAC 

Asp Leu Yal 

GAC CTC GTG 

Asp Asp Lys 

GAC GAC AAG 

A en Trp Thr 

AAC TGG ACC 
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Thr Ser Gly 

ACC TCG GGC 

Asn lie Thr 

AAC ATC ACC 

Gly Ala Ala 

GGC GCC GCG 

240 

Ala Leu Yal 

GCG CTG GTC 

250 

Leu Arg Thr 

CTG CGC ACG 

260 

Ala Lys Asn 

GCG AAG AAC 

lie Ala Lys 

ATC GCC AAG 



270 
Ala 

GCC 



Gly 
GGC 



Asn 
AAC 



Yal 
GTC 



Ser 
TCG 



AAC 



lie 
ATC 



lie 
ATC 



Pro 
CCC 



Ala 
GCC 



280 
Ser 

AGO 



Ala 
GCC 



Thr 
ACG 



Leu 
CTG 



Asn 
AAC 



Ala 
GCC 



Asp 
GAC 



Yal 
GTG 



Arg 
CGC 



Tyr 
TAC 



290 
Ala 

GCG 



Arg 
CGC 



Asn 
AAC 



Glu 
GAG 



Asp 

GAC 



Phe 
TTC 



Asp 

GAC 



Ala 

GCC 



Ala 
GCC 



Met 
ATG 



500 
Lys 

AAG 



Thr 
ACG 



Leu 
CTG 



Glu 
GAA 



Glu 
GAG 



Arg 
CGC 



Ala 

GCG 



Gin 
CAG 



Gin 
CAG 



Lys 
AAG 



510 
AAG 



Leu 
CTG 



Pro 
CCC 



Glu 
GAG 



Ala 
GCC 



29 
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Asp 

GAC 



Val 
GTG 



Lys 
AAG 



Val 

GTG 



lie 
ATC 



320 
Val 

GTG 



Thr 
ACG 



Arg 
CGC 



Gly 
GGC 



Arg 
CGC 



Pro 
CCG 



Ala 
GCC 



Phe 
TTC 



Asn 
AAT 



Ala 
GCC 



330 

Gly 
GGC 



Glu 
GAA 



Gly 
GGC 



Gly 
GGC 



lys 
AAG 



Lya 
AAG 



Leu 
CTG 



Val 
GTC 



Asp 
GAC 



Lys 
AAG 



340 
Ala 

GCG 



Val 
GTG 



Ala 

GCC 



Tyr 
TAC 



Tyr 
TAC 



Lys 
AAG 



Glu 
GAA 



Ala 
GCC 



Gly 
GGC 



Gly 
GGC 



350 
Thr 

ACG 



Leu 
CTG 



Gly 
GGC 



Val 

GTG 



Glu 
GAA 



Glu 
GAG 



Arg 

CGC 



Thr 
ACC 



Gly 
GGC 



Gly 
GGC 



360 
Gly 

GGC 



Thr 
ACC 



Asp 
GAC 



Ala 
GCG 



Ala 
GCC 



Tyr 
TAC 



Ala 
GCC 



Ala 
GCG 



Leu 
CTC 



Ser 
TCA 



370 
Gly 

GGC 



Lys 
AAG 



Pro 
CCA 



Val 
GTG 



lie 
ATC 



Glu 
GAG 



Ser 
AGC 



Leu 
CTG 



Gly 
GGC 



Leu 
CTG 



380 
Pro 

CCG 



Gly 
GGC 



Phe 
TTC 



Gly 
GGC 



Tyr 
TAC 



His 
CAC 



Ser 
AGC 



Asp 

GAC 



Lys 
AAG 



Ala 
GCC 



390 
Glu 

GAG 



Tyr 
TAC 



Val 
GTG 



Asp 
GAC 



lie 
ATC 



Ser 
AGC 



Ala 

GCG 

Ala 

CGC 



lie 
ATT 

Arg 
CGC 



Pro 
CCG 

Leu 
CTG 



Arg 
CGC 

lie 
ATC 



400 
Arg 

CGC 



Met 
ATG 



Leu 
CTG 

410 
Asp 

GAT 



Tyr 
TAC 

Leu 
CTG 



Met 
ATG 

Gly 
GGC 



Ala 
GOT 

Ala 
GCC 



Gly 
GGC 



Lys 
AAG 



TGA - 3« 
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9« A recombinant UKA transfer vector according to any 
preceding claim characterised in that the transfer vector is a 
plasmid. 

10. A recomb inant UNA transfer vector according to claim 9 
whenever taken together with claim 7 having the designation 
pEHl, pKI-1111, pHK14s pHK21 f pEK22, pHK51 f pEK32 or pI2C3. 

11. A microorganism transformed hy a transfer vector 
characterised in that the transfer vector is a recombinant DKA 
transfer vector according "to claim 1. 

12. A microorganism according to claim 11 characterised in 
that the transfer vector is a recombinant USA transfer vector 
according to claim 4- 

13. A microorganism according to either claim 11 or claim 12 
which is a bacterium of the species E.coli, Pseudomonas or 
Bacillus or a yeast of the species Saccharomyces cerevisiae. 

14. A process for the preparation of a gene product characterised 
hy 

(a) culturing a microorganism according to claim 12 in a 
culture medium to produce the gene product in the culture 
medium or the periplastic space of the microorganism, and 

(b) isolating the gene product from the culture medium 
or the periplastic space of the microorganism. 

15. A process according to claim 14 characterised in that the 
gene product is Pseudomonas carb oxyp ept idas e or S.coli 

f± - galactosidase. 
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RESTRICTION ENZYME MAP OF pNM14 

Fig.3. 
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RESTRICTION ENZYME MAP OF pNM22 



Fig.5. 
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Description 

The present invention relates to fragments of specific deoxyri bo nucleotide sequences that promote the 
secretion of gene products from cells and in particular to recombinant DNA transfer vectors that contain 
5 these fragments. 

Recent developments in biochemistry have led to the construction of recombinant DNA transfer 
vectors in which, transfer vectors, for example plasmids, are made to contain exogeneous DNA. In some 
cases the recombinant incorporates heterologous DNA that codes for polypeptides that are ordinarily not 
produced by the organism susceptible to transformation by the recombinant vehicle. 
jo In its basic outline a method of endowing a microorganism with the ability to synthesise a new protein 
involves three general steps: 

(a) isolation and purification of the specific gene or nucleotide sequences containing the genetically 
coded information for the amino acid sequence of the desired protein or polypeptide, 

(b) recombination of the isolated gene or nucleotide sequence with an appropriate transfer vector, 
is typically DNA of a bacteriophage or plasmid to form a recombinant transfer vector that codes, in part, for 

the production of the desired protein or polypeptide, 

(c) transfer of the vector to the appropriate microorganism and selection of a strain of the recipient 
microorganism containing the desired genetic information. 

Provided the gene or nucleotide sequence expresses its protein or polypeptide in the chosen 
20 microorganism, growth of the microorganism should then produce the desired protein or polypeptide in 
significant quantities. 

Once the microorganism has been cultured, the protein or polypeptide must be isolated from the 
undesired materials. This step is considerably facilitated if the majority of the desired protein or 
polypeptide is present in the culture medium and/or the periplasmic space of the microorganism. In other 

25 words purification may be performed in a more efficient manner if, once expressed, the protein or 
polypeptide passes through the cell membrane and out of the. cytoplasm. 

The passage of the protein or polypeptide through the cell membrane is desirable for two main 
reasons. First the desired protein or polypeptide will generally be foreign to the microorganism in which it 
is expressed. In many cases, therefore, it will be quickly broken down by proteolytic enzymes etc. in the 

30 cells cytoplasm and will, subsequently, have a short half life within the cell. By transferring the protein or 
polypeptide out of the cytoplasm soon after expression the stability of the protein or polypeptide will be 
greatly increased. Second the number of unwanted genetic materials and products (from which the desired 
protein or polypeptide must be isolated) will be fangreater in the cell's cytoplasm than in the culture 
medium and/or in the cell's periplasmic space. It can be seen that on both of the above counts the transfer 

35 of the protein or polypeptide through the cell membrane and out of the cytoplasm will greatly facilitate 
protein or polypeptide isolation. 

One way in which the secretion of gene products from the cell's cytoplasm may be promoted is to 
produce, within the cytoplasm, a preprotein or prepolypeptide in which the desired protein or polypeptide 
is preceded by a signal polypeptide. The predominantly hydrophobic signal polypeptide directs the desired 

40 protein or polypeptide to the cell's periplasmic space, where the signal peptide is removed as the desired 
protein or polypeptide traverses the cell membrane. 

Many of the known signal peptides contain cysteine residues. These residues have been found to react 
in the cell membrane and thereby inhibit the efficient transfer of the desired gene product out of the cell. 
It is the primary object of the present invention to provide recombinant DNA transfer vectors 

45 containing a leader sequence polynucleotide that codes for a signal peptide that is cysteine free. Other 
objects and advantages of the present invention will become apparent from the following description 
thereof. 

According to the present invention there is provided a recombinant DNA transfer vector comprising a 
leader sequence polynucleotide coding for signal polypeptide of formula I, 

50 

Met-Arg-Pro-Ser-lle-His-Arg-Thr-Ala-lle-Ala- I 
Ala-Val-Leu-Ala-Thr-Ala-Phe-Val-Ala-Gly-Thr 

The transfer vector may be a bacteriophage or, which is preferred, a plasmid. 
55 Preferably the majority of the codons in the nucleotide sequence are those preferred for the expression 
of microbial genomes. Suitable codons are listed in UK 1,568,047 and UK 2007675 A, and these publications 
are therefore incorporated herein by reference. 

In one preferred embodiment of the present transfer vector the nucleotide sequence has formula II 

60 5'— ATG CGC CCA TCC ATC CAC CGC ACA 

GCC ATC GCC GCC GTG CTG GCC ACC II 

GCC TTC GTG GCG GGC ACC —3' 

The nucleotide sequence coding for the signal polypeptide (the leader sequence polynucleotide) will 
65 preferably be downstream of and in reading phase with a bacterial or a yeast promoter and a prokaryotic 
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ribosome binding site in the transfer vector. Moreover the leader sequence polynucleotide will either be 
upstream of an insertion site for a structural gene or, which is preferred, will be upstream of and in reading 
phase with a structural gene coding for a desired protein or polypeptide. Preferably the gene codes for a 
eukaryotic, particularly a mammalian, protein or polypeptide. 

5 The structural gene may code for such prokaryotic proteins as E. coli p-galactosidase or Pseudomonas 
carboxy peptidase G 2 (CPG 2 ) (Carboxypeptidase G 2 is an enzyme, produced by Pseudomonas species 
strain RS-16, that has application in cancer chemotherapy. It is a Zn 2+ containing dimer of 2x42,000 daltons 
and has high affinities (Km values of 10~ 5 or 10~ 6 M) for both 5-methyltetrahydrofolate, the predominant 
circulatory form of folate in mammals and for the folic acid antagonist methotrexate (MTX), which is widely 

w used in cancer chemotherapy. The enzyme may be used directly for the plasma depletion of reduced 
folates, essential as co-factors in purine and particularly in pyrimidine biosynthesis. CPG 2 has been shown 
to inhibit the development of the Walker 256 carcinoma in vivo and to remove MTX from circulation in 
patients where prolonged exposure to high doses of MTX leads to toxicity). 

Examples of transfer vectors according to the present invention that code for CPG 2 are pNM1 , pNM1 1 1 , 

is pNM14, pNM21, pNM22, pNM31, pNM32 and pLEC3. 

The promoter is preferably a high expression bacterial or yeast promoter for the structural gene in a 
variety of hosts. The particular choice of promoter will depend on the microorganism to be transformed. 
For example the transformation of E. coli will generally be effected by a transfer vector in which an E. coli 
promoter controls the expression of the structural gene. Examples of E. coli promoters are those present in 

20 the plasmids pBR 322 and pAT 1 53. By contrast, the transformation of Pseudomonas species will generally 
be effected by a transfer vector in which a Pseudomonas promoter controls the expression of the structural 
gene. Examples of Pseudomonas promoters are those present in the plasmid pKT 230 or Pseudomonas 
chromosomal DIM A. 

In order to express the structural gene the present transfer vector will be transformed into a suitable 

25 microorganism. According to a further aspect of the present invention therefore there is provided a 
microorganism transformed by a recombinant DNA transfer vector according to this invention. The 
microorganism will preferably be a bacterium or yeast in which high expression of the structural gene, 
within the transfer vector, occurs. Depending on the choice of promoter the microorganism may be a strain 
chosen from one of the following bacteria E. coli, Pseudomonas or the yeast Saccharomyces cerevisiae. 

30 Having transformed the microorganism, the protein or polypeptide, for which the structural gene 
codes, may then be expressed by culturing the transformed microorganism in a culture medium. It is the 
primary advantage of the present invention that culturing the transformed microorganism affords a 
preprotein or prepolypeptide in which the desired protein or polypeptide is preceded by the present signal 
polypeptide. This means that soon after expression the signal polypeptide directs the desired protein or 

35 polypeptide to the cell's periplasmic space, where the signal polypeptide is removed as the desired protein 
or polypeptide traverses the cell membrane. Since the present signal polypeptide is free of cysteine 
residues the desired gene product will be efficiently secreted through the membrane. 

The present transfer vectors may be prepared by any of the methods that are well known in the 
recombinant DNA art For example the leader sequence polynucleotide may be synthesised by the 

40 modified triester method of K. Itakura et al, JACS, 1975, 97, 7327 or by the improved oligodeoxynucleotide 
preparation described in UK 2007675A. The disclosure of both of these references is incorporated herein by 
reference. The synthesised polynucleotide may then be inserted in a transfer vector, preferably a plasmid. 
In the transfer vector it will preferably be downstream of and in reading phase with a bacterial or a yeast 
promoter and a prokaryotic ribosome binding site. The leader sequence polynucleotide should also be 

45 either upstream of a structural gene insertion site or upstream of and in reading phase with a structural 
gene. 

Alternatively, DNA fragments containing the leader sequence polynucleotide may be obtained from 
natural sources, In particular from the chromosomal DNA of Pseudomonas species strain RS-16. In this 
particular case a polynucleotide (formula II above) coding for the present signal polypeptide immediately 
so precedes a structural gene coding for CPG 2 . A number of the DNA fragments containing this leader 
sequence polynucleotide may therefore be recognised by their ability, on insertion into a plasmid and 
transformation of a microorganism by the resultant recombinant vector, to enable a microorganism to 
grow on folate. Examples of such recombinant transfer vectors that contain both a polynucleotide coding 
for the present signal polypeptide (formula li above) and a structural gene coding for CPG 2 are pNM1, 
55 pNM111, pNM14, pNM21, pNM22, pNM31, pNM32 and pLEC3. Of course, once a For recombinant vector 
has been obtained in this way it may be subcloned to afford alternative vectors (either For or Fol") that 
also contain a polynucleotide coding for the present signal polypeptide. 

Once a suitable DNA fragment has been isolated it may then be inserted in a transfer vector, preferably 
a plasmid. In the transfer vector the leader sequence polynucleotide on the inserted fragment should be 
60 downstream of and in reading phase with a bacterial or a yeast promoter and a prokaryotic ribosome 
binding site. The leader sequence polynucleotide should also be either upstream of a structural gene 
insertion site or upstream of and in reading phase with a structural gene. 

The structural gene for insertion downstream of and in reading phase with the present leader sequence 
polynucleotide may be obtained, for example, by the synthetic methods mentioned above (this is 
65 particularly useful for the preparation of genes coding for small proteins). Alternatively the structural gene 
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may be prepared from m=RNA by the use of the enzyme reverse transcriptase or may be isolated from 
natural sources (chromosomal DNA). 

An example of the latter method is the isolation of DNA fragments containing a polynucleotide 
sequence (shown in Table 1) coding for the enzyme CPG 2 (amino acid sequence also shown in Table 1) 

5 from Pseudomonas species strain RS-16 chromosomal DNA. Examples of piasmids containing a CPG 2 
structural gene, as well as a polynucleotide coding for the present signal polypeptide (formula II above), are 
pNM1, pNM111, pNM14, pNM21, pNM22 f pNM31, pNM32 and pLEC3. 

Once prepared or isolated the leader sequence polynucleotide and the structural gene will be inserted 
into a transfer vector, preferably a plasmid, to form a recombinant DNA transfer vector according to the 

'0 present invention. The insertion step or steps will preferably be effected by one of the well known 
techniques in this art that employ restriction endonucleases, see for example the methods discussed in UK 
2090600A, the disclosure of which is incorporated herein by reference. The choice of transfer vector will be 
determined by the microorganism in which the leader sequence polynucleotide and structural gene are to 
be expressed. Generally the transfer vector will be a cloning vehicle that is suitable for transforming the 

is chosen microorganisms and that displays a phenotypical characteristic, such as antibiotic resistance, by 
which the recombinant transfer vectors may be selected. Thus, if the microorganism is to be E. coli, then 
suitable transfer vectors will be the E. coli piasmids pBR322 and pAT153. Alternatively, if the 
microorganism is to be Pseudomonas then a suitable transfer vector will be Pseudomonas pkT230. 



TABLE 1 

A polynucleotide sequence, coding for CPG 2 , isolated from 
Pseudomonas species strain RS-16 chromosomal DNA 



25 


5' — 


1 

Met 
ATG 


Arg 


Pro 

LUA 


Ser 


He 

ATP 


His 

PAP 


Arg 
CGC 


Thr 
ACA 


30 


At. 

Ala 
GCC 


10 
ue 

ATCC 


Ala 

Mia 
GCC 


Ala 

Mid 

GCC 


v di 

GTG 


Leu 
CTG 


Ala 
GCC 


Thr 
ACC 


Ala 
GCC 


35 


Phe 
TTC 


Val 
GTG 


20 

Ala 

GCG 


Gly 
GGC 


Thr 
ACC 


Ala 
GCC 


Leu 
CTG 


Ala 
GCC 


Gin 
CAG 


Lys 
AAG 


Arg 
CGC 


Asp 
GAC 


30 

Asn 

AAC 


Val 
GTG 


Leu 
CTG 


Phe 
TTC 


Gin 
CAG 


Ala 
GCA 


40 


Ala 
GCT 


Thr 
ACC 


Asp 
GAC 


Glu 
GAG 


40 

Gin 

CAG 


Pro 
CCG 


Ala 
GCC 


Val 
GTG 


He ' 
ATC 


45 


Lys 
AAG 


Thr 
ACG 


Leu 
CTG 


Glu 
GAG 


Lys 
AAG 


50 

Leu 

CTG 


Val 
GTC 


Asn 
AAC 


He 
ATC 


50 


Glu 
GAG 


Thr 
ACC 


Gfy 
GGC 


Thr 

ACC 


'Gly 
GGT 


Asp 
GAC 


60 

Ala 

GCC 


Glu 

GAG * 


Gly 
GGC 


55 


He 
ATC 


Ala 
GCC 


Ala 
GCT 


Ala 
GCG 


Gly 
GGC 


Asn 
AAC 


Phe 
TTC 


70 

Leu 

CTC 


Glu 
GAG 


Ala 
GCC 


Glu 
GAG 


Leu 
CTC 


Lys 
AAG 


Asn 
AAC 


Leu 
CTC 


Gly 
GGC 


Phe 
TTC 


80 

Thr 

ACG 


60 


Val 
GTC 


Thr 
ACG 


Arg 
CGA 


Ser 
AGC 


Lys 
AAG 


Ser 
TCG 


Ala 
GCC 


Gly 
GGC 


Leu 
CTG 


65 


90 
Val 
GTG 


Val 
GTG 


Gly 
GGC 


Asp 
GAC 


Asn 
AAC 


lie 
ATC 


Val 
GTG 


Gly 
GGC 


Lys 
AAG 
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TABLE 1 (contd.) 



100 





tie 
ATC 


Lys 
AAG 


Gly 
GGC 


Arg 
CGC 


Gly 
GGC 


Gly 
GGC 


Lys 
AAG 


Asn 
AAC 


Leu 
CTG 


5 


Leu 
CTG 


Leu 
CTG 


110 

Met 
ATG 


Ser 
TCG 


His 
CAC 


Met ■» 
ATG 


Asp 
GAC 


Thr 
ACC 


Val 
GTC 


JO 


Tyr 
TAC 


Leu 
CTC 


Lys 
AAG 


120 
Gly 
GGC 


lie 
ATT 


Leu 
CTC 


Ala 
GCG 


Lys 
AAG 


Ala 
GCC 


15 


Pro 
CCG 


Phe 
TTC 


Arg 
CGC 


Bal 
GTC 


130 
•Glu 
GAA 


Gly 
GGC 


Asp 
GAC 


Lys 
AAG 


Ala 
GCC 


20 


Tyr 
TAC 


Gly 
GGC 


Pro 
CCG 


Gly 
GGC 


lie 
ATC 


140 
Ala 
GCC 


Asp 
GAC 


Asp 
GAC 


Lys 
AAG 


25 


Gly 
GGC 


Gly 
GGC 


Asn 
AAC 


Ala 
GCG 


Val 
GTC 


lie 
ATC 


150 
Leu 
CTG 


His 
CAC 


Thr 
ACG 


Leu 
CTC 


Lys 
AAG 


Leu 
CTG 


Leu 
CTG 


Lys 
AAG 


Glu 
GAA 


Tyr 
TAC 


160 
Gly 
GGC 


Val 
GTG 


30 


Arg 
CGC 


Asp 
GAC 


Tyr 
TAC 


Gly 
GGC 


Thr 
ACC 


ire 

ATC 


Thr 
ACC 


Val 
GTG 


170 
Leu 
CTG 


35 


Phe 
TTC 


. Asn 
AAC 


Tbr 
ACC 


Asp 
GAC 


Glu 
GAG 


Glu 
GAA 


Lys 
AAG 


Gly 
GGT 


Ser 
TCC 


40 


180 
Phe 
TTC 


Gly 
GGC 


Ser 
TCG 


Arg 
CGC 


Asp 
GAC 


Leu 
CTG 


He 
ATC 


Gin 
CAG 


Glu 
GAA 


Glu 
GAA 


190 
Ala 
GCC 


Lys 
AAG 


Leu 
CTG 


Ala 
GCC 


Asp 
GAC 


Tyr 
TAC 


Val 
CTG 


Leu 
CTC 


45 


* 

Ser 
TCC 


Phe 
TTC 


200 
Glu 
GAG 


Pro 
CCC 


Thr 
ACC 


Ser 
AGC 


Ala 
GCA 


Gly 
GGC 


Asp 
GAC 


50 


Glu 
GAA 


Lys 
AAA 


Leu 

CTC 


210 
Ser 
TCG 


Leu 
CTG 


Gly 
GGC 


Thr 

ACC 


Ser 
TCG 


Gly 
GGC 


55 


lie 
ATC 


Ala 
GCC 


Tyr 
TAC 


Val 
GTG 


220 
Gin 
CAG 


Val 
GTC 


Asn 
AAC 


He 
ATC 


Thr 

ACG 


60 


Gly 
GGC 


Lys 
AAG 


Ala 
GCC 


Ser 
TCG 


His 
CAT 


230 
Ala 
GCC 


Gly 
GGC 


Ala 
GCC 


Ala 
GCG 


Pro 
CCC 


Glu 
GAG 


Leu 
CTG 


Gly 
GGC 


Val 
GTG 


Asn 
AAC 


240 
Ala 
GCG 


Leu 
CTG 


Val 
GTC 
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TABLE 1 (contd.) 





Glu 
GAG 


Ala 
GCT 


Ser 
TCC 


Asp 
GAC 


Leu 
CTC 


Val 
GTG 


Leu 
CTG 


250 
Arg 
CGC 


Thr 
ACG 


5 


Met 
ATG 


Asn 
AAC 


lie 
ATC 


Asp 
GAC 


Asp 
GAC 


Lys 
AAG 


Ala 
GCG 


Lys 
AAG 


260 
Asn 
AAC 


JO 


Leu 
CTG 


Arg 
CGC 


Phe 
TTC 


Asn 
AAC 


Trp 
TGG 


Thr 
ACC 


lie 

ATC 


Ala 
GCC 


Lys 
AAG 



15 . 


270 
Ala 
GCC 


Gly 
GGC 


Asn 
AAC 


Val 
GTC 


Ser 
TCG 


Asn 
AAC 


He 
ATC 


lie 
ATC 


Pro 
CCC 


20 


Ala 
GCC 


280 
Ser 
AGC 


Ala 
GCC 


Thr 
ACG 


Leu 
CTG 


Asn 
AAC 


Ala 
GCC 


Asp 
GAC 


Val 
GTG 


Arg 
CGC 


Tyr 
TAC 


290 
Ala 
GCG 


Arg 
CGC 


Asn 
AAC 


Glu 
GAG 


Asp 
GAC 


Phe 
TTC 


Asp 
GAC 


25 


Ala 
GCC 


Ala 
GCC 


Met 
ATG 


300 
Lys 
AAG 


Thr 
ACG 


Leu 
CTG 


Glu 
GAA 


Glu 
GAG 


Arg 
CGC 


30 


Ala 

GCG * 


Gin 
CAG 


Gin 
CAG 


Lys 
AAG 


310 
Lys 
AAG 


Leu 

CTG 


Pro 
CCC 


Glu 
GAG 


Ala 
GCC 


35 


Asp 
GAC 


Val 
GTG 


Lys 
AAG 


Val 
GTG 


lie 

ATC 


320 
Val 
GTC 


Thr 
ACG 


Arg 
CGC 


Gly 
GGC 


40 


Arg 
CGC 


Pro 
CCG 


Ala 
GCC 


Phe 
TTC 


Asn 
AAT 


Ala 

GCC 


330 
Gly 
GGC 


Glu 
GAA 


Gly 
GGC 


Gly 
GGC 


Lys 
AAG 


Lys 
AAG 


Leu 
CTG 


Val 
GTC 


Asp 
GAC 


Lys 
AAG 


340 
Ala 
GCG 


Val 
GTG 


45 


Ala 
GCC 


Tyr 
TAC 


Tyr 
TAC 


Lys 
AAG 


Glu 
GAA 


Ala 
GCC 


Gly 
GGC 


Gly 
GGC 


350 
Thr 
ACG 


50 


Leu 
CTG 


Gly 
GGC 


Val 
GTG 


Glu 
GAA 


Glu 
GAG 


Arg 
CGC 


Thr 
ACC 


Gly 
GGC 


Gly 
GGC 


55 


360 
Gly 
GGC 


Thr 
ACC 


Asp 
GAC 


Ala 
GCG 


Ala 
GCC 


Tyr 
TAC 


Ala 
GCC 


Ala 
GCG 


Leu 
CTC 


Ser 
TCA 


370 
Gly 
GGC 


Lys 
AAG 


Pro 
CCA 


Val 
GTG 


lie 
ATC 


Glu 
GAG 


Ser 
AGC 


Leu 
CTG 


60 


Gly 
GGC 


Leu 
CTG 


380 
Pro 
CCG 


Gly 
GGC 


Phe 
TTC 


Gly 
GGC 


Tyr 
TAC 


His 
CAC 


Ser 
AGC 
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Asp 
GAC 


Lys 
AAG 


Ala 
GCC 
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TABLE 1 (contd.) 

390 

Glu Tyr Val 
GAG TAC GTG 


Asp 
GAC 


He 
ATC 


Ser 
AGC 


Ala 
Mia 

GCG 


Hp 
lie 

ATT 


Prn 

CCG 


Arg 
CGC 


400 
Arg 
CGC 

• 


Leu 
CTG 


Tvr 

i yi 

TAC 


Met 
ATG 


Afa 
GCT 


Ala 
CGC 


Arg 
CGC 


Leu 
CTG 


lie 
ATC 


Met 
ATG 


410 
Asp 
GAT 


Leu 
CTG 


Gly 
GGC 


Ala 
GCC 


Gly 
GGC 


Lys 
AAG 


TGA 


—3' 













Amino acids 1 to 22 are the present signal polypeptide. 
Amino acids 23 to 415 are the CPG 2 structural gene. 

NB The leader sequence polynucleotide is the preferred polynucleotide of formula II. 

20 

The present recombinant DNA transfer vectors, microorganisms transformed by the present 
recombinant DNA transfer vectors and processes for the preparation of said vectors and microorganisms 
will now be described by way of example only, with particular reference to the Figures in which: 
Figure 1 is a restriction enzyme cleavage site map of pNM1, 
25 Figure 2 is a restriction enzyme cleavage site map of pNM111 f 
Figure 3 is a restriction enzyme cleavage site map of pNM14, 
Figure 4 is a restriction enzyme cleavage site map of pNM21, 
Figure 5 is a restriction enzyme cleavage site map of pNM22, and 

Rgure 6 illustrates the process for the preparation of a recombinant plasmid containing both the 
30 present leader sequence polynucleotide and the (3-Galactosidase structural gene, and 
Figure 7 is a restriction enzyme cleavage site map of pL£C3. 

Materials and methods 
Bacterial strains and plasmids 
35 The bacterial strains used were Escherichia coli W5445 {pro leu thi tht° sup E44 lac Y ton A r" m" Stf) 
Pseudomonas putida 2440 (r~) and Pseudomonas sp strain RS-16. The plasmids employed were pBR322 (F. 
Bolivar et al Gene, 1977, 2, 95), pAT153 (A. J. Twigg et al, Nature, 1980, 283, 216) and pKT230 (M. 
Bagdasarin et al, Gene 1981, 16. 237) and pROG5 (R. F. Sherwood et al, The Molecular Biology of Yeast, 
1979 Cold Spring Harbor Publications). 

40 

Media and culture conditions 

£ coli was routinely cultured in L-broth (1% tryptone, 0.5% yeast extract, 0.5% NaCI). Solidified 

medium (L-agar) consisted of L-broth with the addition of 2% (w/v) agar (Bacto-Difco). Antibiotic 

concentrations used for the selection of transformants were 50 ug/ml ampicillin, 15 ug/ml tetracycline and 
45 30 ug/ml kanamycin. In the case of E. coli these were conducted in 2YT liquid medium (1.6% tryptone, 1% 

yeast extract, 0.5% NaCI) containing 1% glucose, and 0.05% folate where appropriate. The pseudomonads 

were grown in a minimal salts solution consisting of per litre: MgS0 4 , 0.05 g; CaCI 2 , 2H 2 0, 0.05 g; 

FeS0 4 -7H 2 0, 0.005 g; MnSO*, 0.0016 g; Na 2 Mbo4, 2H 2 0, 0.0015 g; KH 2 P0 4 ; 5 g; K 2 HP0 4 :3H 2 0, 12 g; 

glutamate, 10 g. The minimal medium employed for E. coli was M9 medium (J. Miller, Experiments in 
so molecular genetics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1972). 

Purification of DNA 

Plasmids were purified from chloramphenicol amplified cultures {D. B. Clewell, J. Bacteriol, 1972, 110, 
667) by Brij-iysis (D. B. Clewell et al, Proc. Natl. Acad. Sci., USA, 1969, 62, 1159) and subsequent caesium 
55 chloride-ethidium bromide density gradient centrifugation (A. Colman et al, Eur. J. Biochem, 1978, 91, 303). 
A rapid, small scale plasmid isolation technique (Birnboim et al. Nucl. Acids Res. 1979, 7, 1513) was also 
employed for screening purposes. Chromosomal DNA from the donor Pseudomonas strain (RS-16) was 
prepared essentially as described by J. Marmar, J. Mol. Biol, 1961, 3, 208. 

eo Restriction, ligation and transformation methods 

Restriction endonucleases and DNA iigase were purchased from Bethesda Research Laboratories and 
used in the buffers and under the conditions recommended by the supplier. Transformation of E coli was 
essentially as described by S. N. Cohen et al.. Proa Natl. Acad. Sci., USA, 1972, 69, 2110, while Ps.putida 
was transformed by the method of M. Bagdasarian and K. N. Timmis, Current Topics in Microbiology and 

65 Immunology, Eds. P. H. Hofschnelder and W. Goebel, Springer Verlag, Berlin, 1981, p. 47. 
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Agarose gel electrophoresis 

Digests were electrophoresed in 0.8% agarose slab gels (10 cmx20 cmxO.5 cm) on a standard vertical 
system (Raven), employing Tris-borate-EDTA buffer. Electrophoresis of undigested DNA was at 125V, 50 
mAfor 3 hours, while digested DNA was electrophoresed at 15V, 10 mAfor 16 hours. Fragment sizes were 
5 estimated by comparison with fragments of MDNA digested with Hind\\\ and ADNA cut with both Hind\\\ and 
EcoBL Fragments were isolated from gels using electroelution (M. W. McDonnell et al, Proc. Natl. Acad. Sci, 
USA, 1977, 74, 4835). 

Determination of carboxy peptidase G 2 activity 

w Bacteria were grown in 1 litre batch culture and 100 ml samples taken at various stages in the growth 
phase. Samples were cooled on ice, centrifuged at 13,000xg for 10 minutes and resuspended and frozen in 
5 ml of 0.1 M Tris HCI, pH 7.3 containing 0.2 mM ZnS0 4 . The cells were disrupted using a MSE Ultrasonic 
Disintegrator (150 W) at medium frequency, amplitude 2, for three 30-second intervals on ice. Cell debris 
was removed by centrifugation at 10,000xg for 5 minutes. CPG 2 activity was determined after J. L. 

15 McCullough et al, J. Biol. Chem, 1971, 246, 7207. A 1 ml reaction cuvette containing 0.9 mi of 0.1 M Tris-HCI, 
pH 7.3 plus 0.2 mM ZnS0 4 and 0.1 ml of 0.6 mM methotrexate was equilibrated at 37°C. Enzyme extract was 
added to the test cuvette and the decrease in absorbance at 320 nm measured using a Pye-Unicam SP1800 
double-beam spectrophotometer. Enzyme activity per ml extract was calculated as A 320 nm absorbance/ 
min divided by 8.3, which is equivalent to the hydrolysis of 1 nmol of MTX/min at 37°C. Protein 

20 concentration was determined by the method of M. M. Bradford, Anal. Biochem., 1976, 72, 248. 

Cell fractionation techniques 

Bacterial cultures were grown in the low phosphate medium of H. C. Neu and L A. Heppel, (J. Biol. 
Chem., 1964, 240, 3685), supplemented with 100 ug/ml ampicillin, to an OD 450 =1.0. 40 ml of culture was 

25 centrifuged at 5000 g for 10 min, washed in 5 ml of 10 mM Tris-HCI pH 7.0, and resuspended in 0.9 ml of 
0.58 M sucrose, 0.2 mM DTT, 30 mM Tris-HCI pH 8.0. Conversion to spheroplasts was achieved by the 
addition of 20 ul of lysozyme (2 mg/ml), 40 pi 0.1 M EDTA, and incubation at 23° for 10 min (H. C. Neu et al, 
J. Biol. Chem., 1964, 233, 3893). The spheroplasts were placed on ice and 0.1 ml of 30% (w/v) BSA added, 
followed by 5 ml of sucrose-tris buffer. Sedimentation of the spheroplasts was achieved by centrifugation 

3 o at 5000 g for 10 min and the supernatant retained as the 'periplasmic' fraction. The pellet was resuspended 
in 5 mM 0 mM Tris-HCI, 0.2 mM DTT pH 7.0 and sonicated at 20 Kc/sec, 2 Amps for 15 sec. Remaining whole 
cells were removed by centrifugation at 1000xg for 10 min. Centrifugation at 100000xg for 1 hr, at 4°C, 
separated the soluble (cytoplasmic) proteins from the particulate (membrane-bound) proteins. The 
membrane pellet was resuspended in 1 ml of 10 mM Tris-HCI, 0.2 mM DTT, pH 7.0. 

35 CPG 2 was assayed as described. Alkaline phosphatase was assayed according to J. Miller, Experiments 
in Molecular genetics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1972, NADH oxidase 
according to M. J. Osborn et al, J. Biol. Chem., 1972, 247, 3962 and glyceraldehyde-3-phosphate 
dehydrogenase after K. Suzuki et al, FEMS, 1971, 13, 217. 

40 Example 1 

Preparation of recombinant plasmid pNM1 (A plasmid containing both the present leader sequence 
polynucleotide and the CPG 2 structural gene). 

To isolate the gene for carboxy peptidase G 2 together with the leader sequence polynucleotide 
chromosomal DNA prepared from the Pseudomonas host (strain RS-16) was partially digested with Sau3A 
45 and fragments of between 6—8 Md isolated from agarose gels by electroelution. The 'sized' DNA was 
ligated with alkaline phosphatase treated BamH\ cut pBR322, transformed into £. cod W5445, and Ap r 
transformants selected. Of the 3,500 Ap r colonies obtained, approximately 70% were Tc s . Utilisation of a 
rapid plasmid isolation technique on 50 Ap r Tc a transformants demonstrated that 90% of the gene bank 
harboured plasmids of the expected size. As a further check on the authenticity of the gene bank, the 
* so individual clones were screened for the acquisition of a Leu + phenotype. Two such clones were identified. 
Both carried a plasmid capable of transforming feu& (B-isopropylmalate dehydrogenase) £ co/i mutants to 
prototrophy. 

Acquisition of a functional CPG 2 gene should enable E. co/i to utilise folic acid as a carbon source. The 
2,400 gene bank clones were screened for the ability to grow on minimal medium containing folate as the 
55 sole source of carbon (i.e. Fol + ). A single Fol + clone was detected and shown to harbour a plasmid capable 
of transforming plasmid-minus W5445 to the Fol + phenotype. Classical restriction mapping of this plasmid 
(pNMD was undertaken which revealed the presence of a 5.9 Md insert of pseudomonad DNA within 
pBR322. The restriction enzyme cleavage site map of pNM1 is given in Figure 1 . The nucleotide sequence of 
the leader sequence polynucleotide and the CPG 2 structural gene is given in Table 1. 

60 

Example 2 

Subcloning of plasmid pNM1 to form pNM111 

In order to pinpoint the position of the CPG 2 gene and the leader sequence polynucleotide within the 
5.9 Md insert, subcloning of various restriction enzyme fragments, into pBR322, was undertaken. A 
65 functional CPG 2 gene was shown not to occur on Xhol or Sphl fragments of the pNM1 insert, but was 
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present ona3.1 MdflglH f rag ment. This latter fragment was cloned into the BamW\ site of pBR322 to give 
pNM11 (6.0 Md). A further reduction in the size of pNM11 was achieved by digesting with Sa11 and 
relegating the resultant fragment to yield pNM1 1 1 . In addition, plasmids in which the smaller 0.95 Md Sal 1 
fragment had become inserted in the opposite orientation to the parent plasmid (pNM1 1 ) were For. Taken 
5 together these subcloning results indicate that the CPG 2 gene and the leader sequence polynucleotide lie 
between the Bg111 site at 4.14 and the SA1 1 site at 6.03 on Pnmi. Furthermore, the gene contains a Sph\ 
(5.17), Sail (5.07) and at least one Xho\ (4.56 and/or 5.56) site. The restriction enzyme cleavage site map of 
pNM111 is given in Rgure 2. 

w Example 3 

Preparation of recombinant plasmid pNM14. (A plasmid containing both the present leader sequence 
polynucleotide and the CPG 2 structural gene) 

The 3.1 Md Bgl II fragment from Example 2 above was partially digested with SauZk. These fragments 
were then cloned into the Bam HI site of pAT153 and transformed into £ coli W5445. Of the two 

j s Ap r Tc s For colonies obtained, one contained a plasmid which had acquired an extra Saf I and Bam HI site, 
this was pNM14. The restriction enzyme cleavage site map of pNM14 is given in Figure 3. Sequencing of 
the leader sequence polynucleotide and the CPG 2 structural gene present in pNM14 gave the nucleotide 
structure shown in Table 1. DNA sequencing of pNM14 also revealed that the Sa/I — Bam HI fragment was a 
duplication of a segment of DNA from within the insert (marked * on Figure 3) composed of two contiguous 

20 Sau 3 A fragments. 

Example 4 and 5 

Preparation of recombinant plasmids pNM21 and pNM22 (Plasmids containing both the present leader 
sequence polynucleotide and the CPG 2 structural gene) 

25 The 3.1 Md Bgl II fragment from Example 2 was cloned into the Bam HI site of pAT 1 53 and transformed 
into E. coli W5445. Two Ap r Tc s FoP colonies were obtained, one containing a plasmid pNM21 in which the 
fragment was inserted in the opposite orientation to pNM1 and one containing a plasmid pNM22 in which 
the fragment was inserted in the same orientation as pNM1. The restriction enzyme cleavage site maps of 
pNM21 and pNM22 are given in Figures 4 and 5 respectively. 

30 The two plasmids, pNM21 and pNM22 both transformed E colito For, indicating that a pseudomonad 
promoter was present on the 3.1 Md fragment. However, cells carrying the plasmid pNM21, in which the 
Bg\\\ fragment was cloned in the opposite orientation to pNM1, exhibited more rapid growth with folic acid 
as the sole carbon source. This difference was clearly visible on agar medium, where colonies developed 
concentric yellow "halos" of precipitated pteroic acid, the insoluble product of folate hydrolysis. 

35 Confirmation that pNM21 gave enhanced expression of CPG 2 over pNM22, was obtained by assaying 
enzyme production during batch growth of cells containing either plasmid. (The ceils were grown in 
complex medium supplemented with 1% (w/v) glucose and where appropriate 0.05% (w/v) folic acid. The 
generation time was 56 — 66 min. The culture was sampled at hourly intervals and whole cells were 
disrupted by sonication. Enzyme activity was determined in the centrifugal extract). Results are given in 

40 Table 2. 

The expression of CPG 2 from the plasmids pNM22 and pNM1 was 2.5 units/litre of culture, 
representing 0.005% soluble protein. In contrast, expression frompNM21 was 3000—3500 units/litre of 
culture, which represented 4.7% soluble protein. As the cloned gene is inserted into the BamHl site of 
pAT153, the observed higher expression of pNM21 is almost certainly due to transcriptional read through 

45 from the Tc promoter. The low expression of CPG 2 carried on plasmids pNM1 and pNM22 is consistent with 
the view that Pseudomonas promoters function poorly in E coli. It is also apparent from Table 2 that in the 
presence of folate there is a two-fold increase in the specific activity of enzyme measured in cell sonicates. 
This phenomenon has been observed in all experiments, but does not seem to be associated with classical 
induction of the CPG 2 gene, as overall enzyme yield in the presence or absence of folate remains at about 

so 3000 u/litre culture. It in fact reflects a consistent depression in the soluble protein levels measured in 
sonicates from cells grown in the presence of folate. There is no obvious difference in growth rate of cells 
grown with folate and the reasons for this result are not clear. 
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TABLE 2 

Carboxypeptidase G production by E. coii W5445, containing the plasmids pNM1, pNM21 and pNM22 

Carboxypeptidase G 2 specific activity 
Cu,ture . (U/MG soluble protein) 



30 



60 



pNM1 pNM22 pNM21 





Age 
(hr) 


-Fol 


+Fol 


-Fol 


+Fol 


-Fol 


+Fol 


10 


1 










11.5 


13.4 




2 










12.9 


9.6 


75 


3 


.008 


.005 


.010 


.019 


13.9 


23.3 




4 


.009 


.011 


.015 


.013 


12.3 


26.9 




5 


.007 


.019 


.016 


.016 


11.5 


25.6 


20 


6 


.005 


.024 


.014 


.023 


13.7 


24.1 




7 


.015 


.029 


.024 


.043 


13.2 


20.6 


25 


8 


.013 


.028 


,024 


.046 


13.0 


23.6 



Expression of the cloned gene in Ps. putida 

The observation that the CPG 2 gene was expressed in £ coii regardless of the orientation of the gene 
within the vector suggested that the promoter region of the CPG 2 gene had been cloned with the structural 
gene and the leader sequence polynucleotide. The low expression of CPG 2 within £ co//*rom its natural 
promoter (pNM1, pNM22, pNM1 11) confirmed other findings that Pseudomonas promoters are poorly 
recognised by £ coli RNA polymerases. It would be expected that if the gene was introduced back into a 
pseudomonad cellular environment then improved expression from the Pseudomonas promoter should 
35 result The 3.1 Md Bg\\\ fragment was subcloned into the Pseudomonas cloning vector pKT230 at its single 
BamH\ site. Two plasmids were obtained, pNM3.1 and pNM32 representing the two possible orientations of 
the cloned gene. These plasmids were transformed into Ps. putida 2440 by the method of Bagdasarian and 
Timmis. Pseudomonas cells carrying both plasmids were cultured in minimal salts medium and enzyme 
production monitored. 

40 Yields of 500—1000 units/litre of culture were obtained regardless of gene orientation within the 
plasmid. Specific activity of the enzyme in cell sonicates was 1.5 to 4.0 U/mg protein representing 0.3 to 
0.7% soluble protein (compared with <0.05% soluble protein in the donor strain RS-16). This result 
strongly indicates that the CPG 2 promoter is present and operated in a pseudomonad background. When 
the same plasmids were transformed into £ coii W5445 12—40 Units/litre were found at specific activity 

45 <0.07 U/mg (<0.01% soluble protein). 

Periplasmic localisation of CPG 2 

There is evidence that CPG 2 is located in or near the periplasmic space of Pseudomonas strain RS-16. 
Pteroic acid, the product of CPG 2 hydrolysis of folic acid is extremely insoluble and is found predminantly 
so outside the cell in both liquid and solid media. Exogeneous pteroic acid is also seen in £ coii cultures 
containing the cloned gene when folic acid is present in the medium. This is clearly demonstrated by the 
"halo" of precipitated pteroic acid observed around colonies carrying plasmids in which expression of 
CPG 2 is from the Tc promoter of pBR322 (e.g. pNM21). 

The localisation of CPG 2 produced by £ coii cells carrying pNM21 was examined by the separation of 
55 cellular proteins into cytoplasmic, periplasmic, and whole membrane fractions. As a control, levels of three 
marker enzymes, alkaline phosphatase (periplasmic), glyceraldehyde-3-phosphate dehydrogenase 
(cytoplasmic) and NADH • 0 2 oxidoreductase (membrane-bound), were also determined. As can be seen 
from Table 3 97% of the CPG 2 activity occurs in the periplasm, equivalent to the marker periplasmic 
enzyme, alkaline phosphatase. This confirms the presence in pNM21 of a leader sequence polynucleotide 
next to the CPG 2 gene that codes for a signal polypeptide according to this invention that promotes the 
secretion of CPG 2 from the cytoplasm into the periplasmic space. 

Carboxypeptidase G 2 synthesised in £ coli 

The specific activity of CPG 2 in crude cell extracts of cells carrying pNM21 was 50-fold higher than 
55 equivalent extracts from Pseudomonas strain RS-16. To determine whether the cloned gene=product in £ 
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coli had the same properties as CPG 2 from the pseudomonad, enzyme was purified from £ coli carrying 
pNM21. The specific activity of purified CPG 2 (single band SDS-PAGE) was 535 U/mg of protein, which 
compares to 550 U/mg of protein from the pseudomonad. CPG 2 purified from £ coli clone pNM21 
co-chromatographed with CPG 2 from Pseudomonas strain RS-16 at a sub-unit molecular weight value of 
5 42,000 daitons. Km values using methotrexate as substrate were 7.4x1 0~ 6 M and 8.0xl0~ 6 M respectively. 
In addition, antiserum raised against the Pseudomonas enzyme indicated immunological identity between 
the £ coli and Pseudomonas CPG 2 , as a confluent precipitation line was formed on Ouchterlony double 
diffusion analysis. 

70 TABLE 3 

Localisation of carboxypeptidase 



Enzyme activity 



Fraction 


CPG 2 


AP 


GAPDH 


NADHOX 


Periplasmic 


97.0 


97.1 


6.8 


0.25 


Cytoplasmic 


2.6 


2.3 


93 


8.4 


Membrane-bound 


0.4 


0.6 


0.2 


89.1 



AP=Alkaline phosphatase 
25 GAPDH=Glyceraldehyde-3-phosphate dehydrogenase 
NADHOX=NADH • 0 2 oxidoreductase 

Example 6 

Preparation of a recombinant plasmid containing both the present leader sequence polynucleotide and the 

30 P-Galactosidase structural gene 

Plasmid pNM14 (Example 3) was treated with Sau 3A (GATC) and the fragments were cloYied into the 
Bam HI site of M13 mp7 template DNA (single stranded DNA (Step A of Figure" 6). The product carrying a 
318bp Sau 3A fragment coding for the present signal polypeptide and the first 22 amino acids of CPG 2 
(nucleotide sequence of this fragment shown in Table 4) was selected and made double stranded. The DNA 

35 coding for the signal polypeptide (and the first part of CPG 2 ) was then excised as an EcoRI fragment This 
- fcoRI fragment was then cloned into the promoter cloning vector E. coli pMC1403 (M. J, Casadaban eta/, J. 
Bacterid, 1980, 143 f 971), which carries only the structural gene (lac Z) for p-galactosidase (i.e. no promoter 
and no ATG start codon) (Steps B and C of Figure 6). Plasmids were obtained in which the EcoRI fragment 
had inserted in both orientations, however, only those in which fusion of the CPG 2 sequence to the 

40 p-galactosidase sequence had occurred (i) yielded a 0.34 Kb fragment upon digestion with BamH\; (ii) 
enabled the host cell to hydrolyse the colourless lactose analogue, BCIG, and impart a blue colouration to 
colonies. The 0.34 Kb BamH\ fragment has been recloned into M13mp7 and sequenced to confirm that 
fusion has occurred. The 'precursor' fusion produced will consist of the signal peptide, the first 22 amino 
acids of CPG 2 , 6 amino acids derived from the M13mp7 and pMC1403 linker units, and-p-gaiactosidase 

45 from its 8th amino acid onward. 

Localisation experiments have been performed on cells carrying a plasmid coding for the 'fusion gene' 
where the cellular proteins have been fractionated into periplasmic, cytoplasmic and membrane fractions. 
In these experiments an organism (E. coli MC 1061) which is deleted for the lac Z gene was grown in 
phosphate medium (H. C. Neu et al, J. Biol. Chem., 1964, 240, 3685) and periplasmic enzymes were released 

so from the harvested cells by conversion to spheroplasts. Separation of soluble proteins (cytoplasmic) from 
particulate proteins (membrane band) was achieved by sonicating the harvested spheroplasts and 
subsequent centrifugation at 100,000 g for 1 hr, to sediment the cell membrane (T. J. Silhary et al, Proc. 
Natl. Acad. Sci. USA, 1976, 73, 3423). 

The results given in Table 5 demonstrate the presence of 50% of the p-galactosidase activity in the 

55 periplasmic space. This result is in direct contrast to similar work involving fusion of other periplasmic 
protein signal sequences to P-galactosidase, where the fusion proteins are not exported, but become 
jammed in the membrane (P. J. Bassford et al, J. Bacteriol, 1979, 139, 19 and S. D. Emr et al, J. Cell, Biol., 
1980, 86, 701). 

60 
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TABLE 4 

The polynucleotide sequence of the 318 bp Sau 3A fragment from 
recombinant plasmid pNM14 



5' — G 


ATC 


CAC 


GCA 


CTG 


AAG 


GCG 


CGC 


GGC 


AAG 


ACG 


CGC 


GGC 


GTG 


GCG 


ACG 


CTG 


TGC 


ATC 


GGC 


GGG 


GGC 


GAA 


GGC 


ACC 


GCA 


GTG 


GCA 


CTC 


GAT 


TGC 


TAT 


AAG 


AAC 


CAT 


GGC 


Tftn 

1 VJ VJ 


GGA 


CGC 


CCG 


ACA 


ACA 


GGC 


GTC 


CAC 


CAG 


CTT 


TTT 


TCA 


TTC 


CGA 


CAA 


CCC 


GAA 


CGA 


ACA 


ATG 


CGT 


AGA 


GCA 


GGA 


GAT 


TCC 




Met 
ATG 


Arg 
CGC 


Pro 

CCA 


Ser 
TCC 


lie 
ATC 


His 
CAC 


Arg 
CGC 


Thr 

i nr 

ACA 


Ala 
GCC 


He 
ATC 


Ala 
GCC 


Ala 

GCC 


Val 
GTG 


Leu 
CTG 


Ala 
GCC 


Thr 
ACC 


Ala 
GCC 


Phe 
TTC 


Val 
GTG 


Ala 
GCG 


Gly 
GGC 


Thr 
ACC 


Ala 
GCC 


Leu 

CTG 


Ala 
GCC 


Gin 
CAG 


Lys 
AAG 


Arg 
CGC 


Asp 
GAC 


Asn 
AAC 


Val 
GTG 


Leu 
•CTG 


Phe 
TTC 


Gin 
CAG 


Ala 
GCA 


Ala 
GCT 


Thr 

ACC 


Asp 
GAC 


Glu - 
GAG 


Gin 
CAG 


Pro 

CCG 


Ala 
GCC 


Val 
GTG 


He 
ATC 



NB. This fragment carries the leader sequence coding for the signal polypeptide, a part of the CPG 2 
structural gene coding for the first 22 amino acids of the protein, the ATG start codon, the CPG 2 ribosome 
binding site (AGGA) and other components of the CPG 2 promoter region. 

TABLE 5 

Localisation of signal peptide— p-galactosidase fusion protein 



% Localisation 3 





CPGa/B-gal 


AP 


GAPDH 


NADHOX 


Periplasmic 


50.3 


97.3 


3.4 


0.4 


Cytoplasmic 


30.9 


2.5 


95.3 


8.2 


Membrane-bound 


18.8 


0.2 


1.3 


89.4 



a -average results from 4 experiments 

CPGa/B-gal=Carboxypeptidase G 2 -p-galactosidase fusion protein 
AP=Alkaline phosphatase 

GAPDH=Glyceraldehyde-3-phosphate dehydrogenase 
NADHOX=NADH • 0 2 Oxidoreductase 

Example 7 

Preparation of a recombinant plasmid, containing both the present leader sequence polynucleotide and the 
CPG 2 structural gene, able to replicate in £ coli and 5. serevisiae 

A 2.03 kilobase BamH\ fragment coding for the present signal polypeptide and the entire CPG 2 
molecule was cloned in both orientations into the BamH\ site of an E coii/S. cerevis/ae shuttle vector 
pR0G5 (R. F. Sherwood and R. K. Gibson, The Molecular Biology of Yeast 1979, Cold Spring Harbor 
Publications) to give plasmids pLEC3 and pLEC4 (Figure 7). These plasmids were transformed into S. 
cerevisiae strain LL20 by the lithium acetate induced transformation method described by Ito etaL, J. Bact, 
1983, 753, 163. Yields equivalent to 10—20 units/litre of culture volume were obtained regardless of gene 
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orientation within the plasmid. Specific activity of the enzyme in total cell extracts was 0.2—0.3 u/mg 
protein representing 0.005% soluble protein. This level of expression from the pseudomonad promoter in a 
yeast background is similar to the level found when the gene was reading from its own promoter in E. coli 
(0.01% soluble protein). 

5 Localisation experiments have been performed on yeast cells carrying the above plasmids by 
sphaeroplasting the cells using standard techniques described by J. B. D. Beggs, Nature, 1978, 275, 105. 
Periplasmic enzymes, localised outside of the cell membrane, were released when the cell wall was 
removed. The osmotic stabiliser (1.2M sorbitol) was then replaced by 0.1 M Tris-HCI buffer, pH 7.3 
containing 0.2 mM ZnCI 2 to lyse the sphaeroplasts and the whole centrifuged at 100,000xg for 1 hour to 

to separate proteins in the soluble cytoplasmic fraction from membrane bound proteins. The results in Table 
6 demonstrate the presence of 64% of the CPG 2 activity in the periplasmic fraction and a further 16% 
associated with the cell membrane fraction. 

TABLE 6 

15 Localisation of CPG 2 in S. cerevisiae 

% CPG 2 activity 



Periplasmic 64 

20 

Cytoplasmic 20 
Membrane bound 16 

25 Claims 

1. A recombinant DNA transfer vector comprising a leader sequence polynucleotide downstream of 
and in reading phase with a bacterial or yeast promoter and a prokaryotic ribosome binding site and 
upstream of and in reading phase with a structural gene, characterised in that the leader sequence 

30 polynucleotide codes for a signal polypeptide of formula I 

Met-Arg-Pro-Ser-lle-His-Arg-Thr- 

Ala-lle-Ala-Ala-Val-Leu-Ala-Thr- I 
Ala-Phe-Val-Ala-Gly-Thr 

35 

2. A recombinant DNA transfer vector according to claim 1 characterised in that the leader sequence 
polynucleotide is of formula II 

5'__ ATG CGC CCA TCC ATC CAC CGC ACA 

4Q GCC ATC GCC GCC GTG CTG GCC ACC (I 

GCC TTC GTG GCG GGC ACC —3'. 

3. A recombinant DNA transfer vector according to either claim 1 or claim 2 characterised in that the 
structural gene codes for Pseudomonas carboxy peptidase G 2 (CPG 2 ). 

45 4. A recombinant DNA transfer vector according to either claim 1 or claim 2 characterised in that the 
structural gene codes for a protein or polypeptide other than Pseudomonas carboxypeptidase G 2 . 

5. A recombinant DNA transfer vector according to claim 4 characterised in that the structural gene 
codes for a prokaryotic protein other than Pseudomonas carboxypeptidase G 2 . 

6. A recombinant DNA transfer vector according to claim 5 characterised in that the structural gene- 
50 codes for E. Coli p-galactosidase. 

7. A recombinant DNA transfer vector according to claim 3 comprising a polynucleotide of formula 



1 





Met 


Arg 


Pro 


Ser 


He 


His 


Arg 


Thr 


5'— 


ATG 


CGC 


CCA 


TCC 


ATC 


CAC* 


CGC 


ACA 




10 
















Ala 


lie 


Ala 


Ala 


Val 


Leu 


Ala 


Thr 


Ala 


GCC 


ATC . 


GCC 


GCC 


GTG 


"CTG 


GCC 


ACC 


GCC 






20 














Phe 


Val 


Ala 


Gly 


Thr 


Ala 


Leu 


Ala 


Gin 


TTC 


GTG 


GCG 


GGC 


ACC 


GCC 


CTG 


GCC 


CAG 
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Lys 
AAG 


Arg 
CGC 


Asp 
GAC 


30 

Asn 

AAC 


Val 
GTG 


Leu 
CTG 


Phe 
TTC 


Gin 
CAG 


Ala 
GCA 


5 


Ala 
GCT 


Thr 
ACC 


Asp 
GAC 


Glu 
GAG 


40 

Gin 

CAG 


Pro 
CCG 


Ala 
GCC 


Val 
GTG 


lie 
ATC 


W 


Lys 
AAG 


Thr 
ACG 


Leu 
CTG 


Glu 
GAG 


Lys 
AAG 


50 

Leu 

CTG 


Val 
GTC 


Asn 
AAC 


He 
ATC 


15 


Glu 
GAG 


Thr 
ACC 


Gly 
GGC 


Thr 
ACC 


Gly 
GGT 


Asp 
GAC 


60 

Ala 

GCC 


Glu 
GAG 


Gly 
GGC 


20 


He 
ATC 


Ala 

GCC 


Ala 
GCT 


Ala 
GCG 


Gly 
GGC 


Asn 
AAC 


Phe 
TTC 


70 

Leu 

CTC 


Glu 
GAG 


Ala 
GCC 


Glu 
GAG 


Leu 
CTC 


Lys 
AAG 


Asn 
AAC 


Leu 
CTC 


Gly 
GGC 


Phe 
TTC 


80 

Thr 

ACG 


25 


Val 
GTC 


Thr 
ACG 


Arg 
CGA 


Ser 
AGC 


Lys 
AAG 


Ser 
TCG 


Ala 
GCC 


Gly 
GGC 


Leu 
CTG 


30 


90 

Val 

GTG 


Val 
GTG 


Gly 
GGC 


Asp 
GAC 


Asn 
AAC 


lie 
ATC 


Val 
GTG 


Gly 
GGC 


Lys 
AAG 


35 


He 
ATC 


100 

Lfs 
AAG 


Gly 
GGC 


Arg 
CGC 


Gly 
GGC 


Gly 
GGC 


Lys 
AAG 


Asn 
AAC 


Leu 
CTG 


Leu 
CTG 


Leu 
CTG 


110 

Met 
ATG 


Ser 
TCG 


His 
CAC 


Met 
ATG 


Asp 
GAC 


Thr 
ACC 


Val 
GTC 


40 


Tyr 
TAC 


Leu 
CTC 


Lys 
AAG 


120 
Gly 
GGC 


He 
ATT 


Leu 
CTC 


Ala 
GCG 


Lys 
AAG 


Ala 
GCC 


45 


Pro 
CCG 


Phe 
TTC 


Arg 
CGC 


Val 
GTC 


130 
Glu 
GAA 


Gly 
GGC 


Asp 
GAC 


Lys 
AAG 


Ala 
GCC 


SO 


Tyr 
TAC 


Gly 
GGC 


Pro 
CCG 


Gly 
GGC 


lie 
ATC 


140 
. Ala 
GCC 


Asp 
GAC 


Asp 
GAC 


Lys 
AAG 


55 


Gly 
GGC 


Gly 
GGC 


Asn 
AAC 


Ala 
GCG 


Val 
GTC 


He * 
ATC 


150 
Leu 
CTG 


His 
CAC 


Thr 
-ACG 


Leu 
CTC 


Lys 
AAG 


Leu 
CTG 


Leu 
CTG 


Lys 
AAG 


Glu 
GAA 


Tyr 
TAC 


160 
Gly 
GGC 


Val 
GTG 


60 


Arg 
CGC 


Asp 
GAC 


Tyr 
TAC 


Gly 
GGC 


Thr 
ACC 


He 
ATC 


Thr 
ACC 


Val 
GTG 


170 
Leu 
CTG 


65 


Phe 
TTC 


Asn 
AAC 


Thr 

ACC 


Asp 
GAC 


Glu 
GAG 


Glu 
GAA 


Lys 
AAG 


Gly 
GGT 


Ser 
TCC 
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180 





Phe 
TTC 


Gly 
GGC 


Ser 
TCG 


Arg 
CGC 


Asp 
GAC 


Leu 
CTG 


lie 
ATC 


Gin 
CAG 


Glu 
GAA 


5 . 


Glu 
GAA 


190 
Ala 
GCC 


Lys 
AAG 


Leu 
CTG 


Ala 
GCC 


Asp 
GAC 


Tyr 
TAC 


Val 
GTG 


Leu 
CTC 


JO 


Ser 
TCC 


Phe 
TTC 


200 
Glu 
GAG 


Pro 
CCC 


Thr 
ACC 


Ser 
AGC 


Ala 
GCA 


Gly 
GGC 


Asp 
GAC 


15 


Glu 
GAA 


Lys 
AAA 


Leu 
CTC 


210 
Ser 
TCG 


Leu 
CTG 


Gly 
GGC 


Thr 
ACC 


Ser 
TCG 


Gly 
GGC 


20 


lie 
ATC 


Ala 
GCC 


Tyr 
TAC 


Val 
GTG 


220 
Gin 
CAG 


Val 
GTC 


Asn 
AAC 


lie 
ATC 


Thr 
ACC 


Gly 
GGC 


Lys 
AAG 


Ala 
GCC 


Ser 
TCG 


His 
CAT 


230 
Ala 
GCC 


Gly 
GGC 


Ala 
GCC 


Ala 
GCG 


25 


Pro 
CCC 


Glu 
GAG 


Leu 
CTG 


Gly 
GGC 


Val 
GTG 


Asn 
AAC 


240 
Ala 
GCG 


Leu 
CTG 


Val 
GTC 


30 


Glu 
GAG 


Ala 
GCT 


Ser 
TCC 


Asp 
GAC 


Leu 
CTC 


Val 
GTG 


Leu 
CTG 


250 
Arg 
CGC 


Thr 

ACG 


35 


Met 
ATG 


Asn 
AAC 


He 
ATC 


Asp 
GAC 


Asp 
GAC 


Lys 
AAG 


Ala 
GCG 


Lys 
AAG 


260 
Asn 
AAC 




Leu 
CTG 


Arg 
CGC 


Phe 
TTC 


Asn 
AAC 


Trp 
TGG 


Thr 
ACC 


Me 
ATC 


Ala 
GCC 


Lys 
AAG 


40 


270 
Ala 
GCC 


Gly 
GGC 


Asn 
AAC 


Val 
GTC 


Ser 
TCG 


Asn 
AAC 


He 
ATC 


lie 
ATC 


Pro 

CCC 


45 


Ala 
GCC 


280 
Ser 
AGC 


Ala 
GCC 


Thr 
ACG 


Leu 
CTG 


Asn 
AAC 


Ala 
GCC 


Asp 
GAC 


Val 
GTG 


50 


Arg 

CGC 


Tyr 
TAC 


290 
Ala 
GCG 


Arg 
CGC 


Asn 
AAC 


Glu 
GAG 


Asp 
GAC 


Phe 
TTC 


Asp 
GAC 


55 


Ala 
GCC 


Ala 
GCC 


Met 
ATG 


300 
Lys 
AAG 


Thr 
ACG 


Leu 
CTG 


Glu 
GAA 


Glu 
GAG 


Arg 
CGC 


Ala 
GCG 


Gin 
CAG 


Gin 
CAG 


Lys 
AAG 


310 
Lys 
AAG 


Leu 
CTG 


Pro 
CCC 


Glu 
GAG 


Ala 
GCC 


60 


Asp 
GAC 


Val 
GTG 


Lys 
AAG 


Val 
GTG 


lie 

ATC 


320 
Val 
GTC 


Thr 
ACG 


Arg 
CGC 


Gly 
GGC 
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330 



Arg 
CGC 


Pro 

CCG 


Ala 
GCC 


Phe 
TTC 


Asn 
AAT 


Ala 
GCC 


Gly 
GGC 


Glu 
GAA 


Gly 
GGC 


Gly 
GGC 


Lys 
AAG 


Lys 
AAG 


Leu 
CTG 


Val 
GTC 


Asp 
GAC 


Lys 
AAG 


340 
Ala 
GCG 


Val 
GTG 


Ala 
GCC 


Tyr 
TAC 


Tyr 
TAC 


Lys 
AAG 


Glu 
GAA 


Ala 
GCC 


Gly 
GGC 


Gly 
GGC 


350 
Thr 
ACG 


Leu 
CTG 


Gly 
GGC 


Val 
GTG 


Glu 
GAA 


Glu 
GAG 


Arg 
CGC 


Thr 
ACC 


Gly 
GGC 


Gly 
GGC 


360 
Gly 
GGC 


Thr 

ACC 


Asp 
GAC 


Ala 
GCG 


Ala 
GCC 


Tyr 
TAC 


Ala 
GCC 


Ala 
GCG 


Leu 
CTC 


Ser 
TCA 


370 
Gly 
GGC 


Lys 
AAG 


Pro 
CCA 


Val 
GTG 


He 
ATC 


Glu 
GAG 


Ser 
AGC 


Leu 

. CTG 


Gly 
GGC 


Leu 
CTG 


380 
Pro 
CCG 


Gly 
GGC 


Phe 
TTC 


Gly 
GGC 


Tyr 
TAC 


His 
CAC 


Ser 
AGC 


Asp 
GAC 


Lys 
AAG 


Ala 
GCC 


390 
Glu 
GAG 


Tyr 
TAC 


Val 
GTG 


Asp 
GAC 


He 
ATC 


Ser 
AGC 


Ala 
GCG 


lie 
ATT 


Pro 
CCG 


Arg 
CGC 


400 
Arg 
. CGC 


Leu 
CTG 


Tyr 
TAC 


Met 
ATG 


Ala 
GCT 


Ala 
CGC 


Arg 
CGC 


Leu 
CTG 


He 
ATC 


Met 
ATG 


410 
Asp 
GAT 


Leu 
CTG 


Gly 
GGC 


Ala" 
GCC 


Gly 
GGC 


Lys 
AAG 


TGA 


—3' 













8. A recombinant DNA transfer vector according to any preceding claim characterised in that the 
transfer vector is a pfasmid. 

9. A microorganism transformed by a transfer vector characterised in that the transfer vector is a 
recombinant DNA transfer vector according to any one of the preceding claims 1 to 8. 

10. A microorganism according to claim 9 which is a bacterium of the species E. coli or Pseudomonas 
or a yeast of the species Saccharomyces cerevisiae. 

11. A process for the preparation of a gene product characterised by 

(a) culturing a microorganism according to either claim 9 or claim 10 in a culture medium to produce 
the gene product in the culture medium or the periplasmic space of the microorganism, and 

(b) isolating the gene product from the culture medium or the periplasmic space of the microorganism. 

12. A process according to claim 11 characterised in that the gene product is Pseudomonas 
carboxy peptidase G 2 . 

13. A process according to claim 1 1 characterised in that the gene product is a protein or polypeptide 
other than Pseudomonas carboxypeptidase G 2 . 

14. A process according to claim 13 characterised in that the gene product is a prokaryotic protein other 
than Pseudomonas carboxypeptidase G 2 . 

15. A process according to claim 14 characterised in that the gene product is E. coli (5-galactosidase. 

PatentansprGche 

1. rtekombinanter DNA-Transfervektor, enthaltend ein Leitsequenz-polynukleotid stromabwarts von 
und in Lesephase mit einem Bakterien- oder Hefepromotor und einer prokaryotischen Ribosomen- 
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Bindestelle und stromaufwarts von und in Lesephase mit einem Strukturgen, dadurch gekennzeichnet, daR 
das Leitsequenz-polynukleotid fur ein Signalpolypeptid der Formel I codiert 

Met-Arg-Pro-Ser-lle-His-Arg-Thr- 
5 Ala-lle-Ala-Ala-Val-Leu-AIa-Thr- I 

Ala-Phe-Val-Ala-Giy-Thr. 

2. Rekombinanter DNA-Transfervektor nach Anspruch 1, dadurch gekennzeichnet, daft das 
Leitsequenz-polynukleotid die Formel II aufweist 

JO 

5 '_ ATG CGC CCA TCC ATC CAC CGC ACA 

GCC ATC GCC GCC GTG CTG GCC ACC II 

GCC TTC GTG GCG GGC ACC —3'. 

75 3. Rekombinanter DNA-Transfervektor nach Anspruch 1 oder 2, dadurch gekennzeichnet, daB das 
Strukturgen fur Pseudomonas-Carboxypeptidase G 2 (CPG 2 ) codiert. 

4. Rekombinanter DNA-Transfervektor nach Anspruch 1 oder 2, dadurch gekennzeichnet, daB das 
Sturkturgen fur ein anderes Protein oder Polypeptid codiert als Pseudomonas-Carboxypeptidase G 2 . 

5. Rekombinanter DNA-Transfervektor nach Anspruch 4, dadurch gekennzeichnet, daS das Strukturgen 
20 fur ein anderes prokaryotisches Protein als Pseudomonas-Carboxypeptidase G 2 codiert. 

6. Rekombinanter DNA-Transfervektor nach Anspruch 5, dadurch gekennzeichnet, dafc das Strukturgen 
fur (3-Galactosidase aus E. coli codiert. 

7. Rekombinanter DNA-Transfervektor nach Anspruch 3, enthaltend ein Polynukieotid der Formel 

25 1 



50 



60 





Met 


Arg 


rTO 


oer 


lie 


nib 


Arg 


Thr 

i nr 


o — 


Al o 


PC2P 




jrr 
1 UL» 


ATP 


PAP 


Pf5P 


APA 




10 
















Ala 


lie 


Ala 


Ala 


Val 


Leu 


Ala 


Thr 


Ala 


GCC 


ATC 


GCC 


GCC 


GTG 


CTG 


GCC 


ACC 


GCC 






20 














Phe 


Val 


Ala 


Gly 


Thr 


Ala 


Leu 


Ala 


Gin 


TTC 


GTG 


GCG 


GGC 


ACC 


GCC 


CTG 


GCC 


CAG 








30 












Lys 


Arg 


Asp 


Asn 


Vai 


Leu 


Phe 


Gin 


Ala 


AAG 


CGC 


GAC 


AAC 


GTG 


CTG 


TTC 


CAG 


GCA 










40 










Ala 


Thr 


Asp 


Glu 


Gin 


Pro 


Ala 


Val 


He 


GCT 


ACC 


GAC 


GAG 


CAG 


CCG 


GCC 


GTG 


ATC 












50 








Lys 


Thr 


Leu 


Glu 


Lys 


Leu 


Val 


Asn 


He 


AAG 


ACG 


CTG 


GAG 


AAG 


CTG 


GTC 


AAC 


ATC 














60 






Glu 


Thr 


Gly 


Thr 


Gly 


Asp 


Ala 


Glu 


Gly 


GAG 


ACC 


GGC 


ACC 


GGT 


GAC 


GCC 


GAG 


GGC 
















70 




lie 


Ala 


Ala 


Ala 


Gly 


Asn 


Phe 


Leu 


Glu 


ATC 


GCC 


GCT 


GCG 


GGC 


AAC 


TTC 


CTC 


GAG 


















80 


Ala 


Glu 


Leu 


Lys 


Asn 


Leu 


Gly 


Phe 


Thr 


GCC 


GAG 


CTC 


AAG 


AAC 


CTC 


GGC 


TTC 


ACG 


Val 


Thr 


Arg 


Ser 


Lys 


Ser 


Ala 


Gly 


Leu 


GTC 


ACG 


CGA 


AGC 


AAG 


TCG 


GCC 


GGC 


CTG 
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90 





Val 
GTG 


Val 
GTG 


Gly 
GGC 


Asp 
GAC 


Asn- 
AAC 


lie 
ATC 


Val 
GTG 


Gly 
GGC 


Lys 
AAG 


5 


lie 
ATC 


100 
Lys 
AAG 


Gly 
GGC 


Arg 
CGC 


Gly 
GGC 


Gly 
GGC 


Lys 
AAG 


Asn 
AAC 


Leu 
CTG 


10 


Leu 
CTG 


Leu 
CTG 


110 
Met 
ATG 


Ser 
TCG 


His 
CAC 


Met 
ATG 


Asp 
GAC 


Thr 
ACC 


Val 
GTC 


15 


Tyr 
TAC 


Leu 
CTC 


Lys 
AAG 


120 
Gly 
GGC 


He 
ATT 


Leu 
CTC 


Ala 
GCG 


Lys 
AAG 


Ala 
GCC 




Pro 

CCG 


Phe 
TTC 


Arg 
CGC 


Val 
GTC 


130 
Glu 
GAA 


Gly 
GGC 


Asp 
GAC 


Lys 
AAG 


Ala 
GCC 


20 


Tyr 
TAC 


Gly 
GGC 


Pro 
CCG 


Gly 
GGC 


lie 
ATC 


140 
Ala 
GCC 


Asp 
GAC 


Asp 
GAC 


Lys 
AAG 


25 


Gly 
GGC 


Gly 
GGC 


Asn 
AAC 


Ala 
GCG 


Val 
GTC 


He 
ATC 


150 
Leu 
CTG 


His 
CAC 


Thr 
ACG 


30 


Leu 
CTC 


Lys 
AAG 


Leu 
CTG 


Leu 
CTG 


Lys 
AAG 


Glu 
GAA 


Tyr 
TAC 


160 
Gly 
GGC 


Val 
GTG 


35 


Arg 
CGC 


Asp 
GAC 


Tyr 
TAC 


Gly 
GGC 


Thr 
ACC 


He 
ATC 


Thr 
ACC 


Val 
GTG 


170 
Leu 
CTG 




Phe 
TTC 


Asn 
AAC 


Thr 

ACC 


Asp 
GAC 


Glu 
GAG 


Glu 
GAA 


Lys 
AAG 


Gly 
GGT 


Ser 
TCC 


40 , 


180 
Phe 
TTC 


Gly 
GGC 


Ser 
TCG 


Arg 
CGC 


Asp 
GAC 


Leu 
CTG 


He 
ATC 


Gin 
CAG 


Glu 
GAA 


45 


Giu 
GAA 


190 
Ala 
GCC 


Lys 
AAG 


Leu 
CTG 


Ala 
GCC 


Asp 
GAC 


Tyr 
TAC 


Val 
GTG 


Leu 
CTC 


50 


Ser 
TCC 


Phe 
TTC 


200 
Giu 
GAG 


Pro 

CCC 


Thr 

ACC 


Ser 
AGC 


Ala 
GCA 


Gly 
GGC 


Asp 
GAC 




Glu 
GAA 


Lys 
AAA 


Leu 
CTC 


210 
Ser 
TCG 


Leu 
CTG 


Gly 
GGC 


Thr 
ACC 


Ser 
TCG 


Gly 
GGC 


55 


lie 

ATC 


Ala 
GCC 


Tyr 
TAC 


Val 
GTG 


220 
Gin 
CAG 


Val 
GTC 


Asn 
AAC 


He 
ATC 


Thr 
ACC 


60 


Gly 
GGC 


Lys 
AAG 


Ala 
GCC 


Ser 
TCG 


His 
CAT 


230 
Ala 
GCC 


Gly 
GGC 


Ala 
GCC 


Ala 
GCG 



65 
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Pro 
CCC 


Glu 
GAG 


Leu 
CTG 


Gly 
GGC 


Val 
GTG 


Asn 
AAC 


5 


Glu 
GAG 


Ala 
GCT 


Ser 
TCC 


Asp 
GAC 


Leu 
CTC 


Val 
GTG 


10 


Met 
ATG 


Asn 
AAC 


He 
ATC 


Asp 
GAC 


Asp 
GAC 


Lys 
AAG 




Leu 
CTG 


Arg 
CGC 


Phe 
TTC 


Asn 
AAC 


Trp 
TGG 


Thr 
ACC 


15 


270 
Ala 
GCC 


Gly 
GGC 


Asn 
AAC 


Val 
GTC 


Ser 
TCG 


Asn 
AAC 


20 


Ala 
GCC 


280 
Ser 
AGC 


Ala 
GCC 


Thr 
ACG 


Leu 
CTG 


Asn 
AAC 


25 


Arg 
CGC 


Tyr 
TAC 


290 
Ala 
GCG 


Arg 
CGC 


Asn 
AAC 


Glu 
GAG 


30 


Ala 
GCC 


Ala 

GCC 


Met 
ATG 


300 
Lys 
AAG 


Thr 
ACG 


Leu 
CTG 




Ala 
GCG 


Gin 
CAG 


Gin 
CAG 


Lys 
AAG 


310 
Lys 
AAG 


Leu 
CTG 


35 


Asp 
GAC 


Val 
GTG 


Lys 
AAG 


Val 
GTG 


He 
ATC 


Val 
GTC 


40 


Arg 
CGC 


Pro 
CCG 


Ala 
GCC 


Phe 
TTC 


Asn 
AAT 


Ala 
GCC 


45 


Gly 
GGC 


Lys 
AAG 


Lys 
AAG 


Leu 
CTG 


Val 
GTC 


Asp 
GAC 


50 


Ala 
GCC 


Tyr 
TAC 


Tyr 
TAC 


Lys 
AAG 


Glu 
GAA 


Ala 
GCC 




Leu 
CTG 


Gly 
GGC 


Val 
GTG 


Glu 
GAA 


Glu 
GAG 


Arg 
CGC 


55 


360 
Gly 
GGC 


Thr 
ACC 


Asp 
GAC 


Ala 
GCG 


Ala 
GCC 


Tyr 
TAC 


60 


Ser 
TCA 


370 
Gly 
GGC 


Lys 
AAG 


Pro 
CCA 


Val 
GTG 


He 
ATC 


65 


Gly 
GGC 


Leu 
CTG 


380 
Pro 
CCG 


Gly 
GGC 


Phe 
TTC 


Gly 
GGC 



19 



Ala 
GCG 


Leu 
CTG 


Val 
GTC 


Leu 
CTG 


250 
Arg 
CGC 


Thr 

ACG 


Ala 
GCG 


Lys 
AAG 


260 
Asn 
AAC 


lie 
ATC 


Ala 
GCC 


Lys 
AAG 


He 
ATC 


lie 
ATC 


Pro 
CCC 


Ala 
GCC 


Asp 
GAC 


Val 
GTG 


Asp 
GAC 


Phe 
TTC 


Asp 
GAC 


Glu 
GAA 


Glu 
GAG 


Arg 
CGC 


Pro 
CCC 


Glu 
GAG 


Ala 
GCC 


Thr 
ACG 


Arg 
CGC 


Gly 
GGC 


330 
Gly 
GGC 


Glu 
GAA 


Gly 
GGC 


Lys 
AAG 


340 
Ala 
GCG 


Val 
GTG 


Gly 
GGC 


Gly 
GGC 


350 
Thr 
ACG 


Thr 
ACC 


Gly 
GGC 


Gly 
GGC 


Ala 
GCC 


Ala 
GCG 


Leu 
CTC 


Glu 
GAG 


Ser 
AGC 


Leu 
CTG 


Tyr 
TAC 


His 
CAC 


Ser 
AGC 
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390 



10 



15 



20 



25 



Asp 
GAC 


Lys 
AAG 


Ala 
GCC 


Glu 
GAG 


Tyr 
TAC 


Val 
GTG 


Asp 
GAC 


lie 
ATC 


Ser 
AGC 


Ala 
GCG 


lie 
ATT 


Pro 
CCG 


Arg 
CGC 


400 
Arg 
CGC 


Leu 
CTG 


Tyr 
TAC 


Met 
ATG 


Ala 
GCT 


Ala 
CGC 


Arg 
CGC 


Leu 
CTG 


lie 
ATC 


Met 
ATG 


410 
Asp 
GAT 


Leu 
CTG 


Gly . 
GGC 


Ala 
GCC 


Gly 
GGC 


Lys 
AAG 


TGA 


— 3' 













8. Rekombinanter DNA-Transfervektor nach einem der vorstehenden Anspruche, dadurch 
gekennzeichnet, daB der Transfervektor ein Plasmid ist. 

9. Durch einen Transfervektor transformierter Mikroorganisrnus, dadurch gekennzeichnet, daB der 
Transfervektor ein rekombinanter DNA-Transfervektor nach einem der Anspruche 1 bis 8 ist. 

10. Mikroorganisrnus nach Anspruch 9, der ein Bakterium der Gattung E. coli Oder Pseudomonas oder 
eine Hefe der Gattung Saccharomyces cerevisiae ist. 

11. Verfahren zur Herstellung eines Genprodukts, gekennzeichnet durch 

(a) Zuchten eines Mikroorganisrnus nach Anspruch 9 oder 10 in einem Kulturmedium zur Herstellung 
des Genprodukts in Kulturmedium oder im periplasmatischen Raum des Mikroorganisrnus, und 

{b) Isolieren des Genprodukts aus dem Kulturmedium oder dem periplasmatischen Raum des 
Mikroorganisrnus. 

12. Verfahren nach Anspruch 11, gekennzeichnet durch Herstellung von Pseudomonas- 
Carboxypeptidase G 2 als Genprodukt. 

13. Verfahren nach Anspruch 11, gekennzeichnet durch Herstellung eines anderen Proteins oder 
30 Polypeptids als Pseudomonas-Carboxypeptidase G 2 als Genprodukt. 

14. Verfahren nach Anspruch 13, gekennzeichnet durch eines anderen prokaryotischen Proteins als 
Pseudomonas-Carboxypeptidase G 2 als Genprodukt. 

15. Verfahren nach Anspruch 14, gekennzeichnet durch p-Galactosidase aus E. coli als Genprodukt. 

35 Revendications 

1. Un vecteur de transfert d'ADN recombinant comprenant une sequence guide de polynucleotide en 
aval et en phase de lecture relativement a un promoteur de bacterie ou de levure et a un site de liaison 
ribosomique procaryotique et en amont et en phase de lecture relativement a un gene de structure, 
40 caracterise en ce que la sequence guide de polynucleotide code pour un polypeptide signal de formule I 

Met-Arg-Pro-Ser-lle-His-Arg-Thr-Ala-lle-Ala 

Ala-Val-Leu-Ala-Thr-Ala-Phe-Val-Ala-Gly-Thr. I 

45 2. Un vecteur de transfert d'ADN recombinant selon Ja revendication 1, caracterise en ca que la 
sequence guide de polynucleotide repond a la formule II 

5'— ATG CGC CCA TCC ATC CAC CGC ACA 

GCC ATC GCC GCC GTG CTG GCC ACC II 
so GCC TTC GTG GCG GGC ACC —3' 

3. Un vecteur de transfer d'ADN recombinant selon I'une ou I'autre des revendications 1 ou 2, 
caracterise en ce que le gene de structure code pour la carboxypeptidase G 2 de Pseudomonas (CPG 2 ). 

4. Un vecteur de transfer d'ADN recombinant selon I'une ou I'autre des revendications 1 ou 2, 
ss caracterise en ce que le gene de structure code pour une proteine ou un polypeptide autre que la 

carboxypeptidase G 2 de Pseudomonas. 

5. Un vecteur de transfert d'ADN recombinant selon la revendication 4 caracterise en ce que ie gene de 
structure code pour une proteine procaryotique autre que la carboxypeptidase G 2 de Pseudomonas. 

6. Un vecteur de transfert d'ADN recombinant selon la revendication 5 caracterise en ce que le gene de 
so structure code pour la p-galactosidase de E. coli. 

7. Un vecteur de transfert d'ADN recombinant selon la revendication 3 comprenant un polynucleotide 
de formule 



65 



20 
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1 





5'— 


Met 
ATG 


Arg 
CGC 


Pro 

CCA 


5 


Ala 
GCC 


He 
ATC 


Ala 
GCC 


Ala 
GCC 


10 


Phe 
TTC 


Val 
GTG 


20 

Ala 

GCG 


Gly 
GGC 


15 


Lys 
AAG 


Arg 
CGC 


Asp 
GAC 


30 

Asn 

AAC 


20 


Ala 
GCT 


Thr 
ACC 


Asp 
GAC 


Glu 
GAG 




Lys 
AAG 


Thr 
ACG 


Leu 
CTG 


Glu 
GAG 


25 


Glu 
GAG 


Thr 
ACC 


Gly 
GGC 


Thr 
ACC 


30 


lie 
ATC 


Ala 
GCC 


Ala 
GCT 


Ala 
GCG 


35 


Ala 
GCC 


Glu 
GAG 


Leu 
CTC 


Lys 
AAG 




Val 
GTC 


Thr 
ACG 


Arg 
CGA 


Ser 
AGC 


40 


90 
Val 
GTG 


Val 
GTG 


Gly 
GGC 


Asp 
GAC 


45 


lie 
ATC 


100 
Lys 
AAG 


Gly 
GGC 


Arg 
CGC 


50 


Leu 
CTG 


Leu 
CTG 


110 
Met 
ATG 


Ser 
TCG 


55 


Tyr 
TAC 


Leu 
CTC 


Lys 
AAG 


120 
Gly 
GGC 




Pro 
CCG 


Phe 
TTC 


Arg 
CGC 


Val 
GTC 


60 


Tyr 
TAC 


Gly 
GGC 


Pro 
CCG 


Gly 
GGC 
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Ser 
TCC 


lie 
ATC 


His 
CAC 


Arg 
CGC 


Thr 
ACA 


Val 
GTG 


Leu 
CTG 


Ala 
GCC 


Thr 
ACC 


Ala 
GCC 


Thr 
ACC 


Ala 
GCC 


Leu 
CTG 


Ala 
GCC 


Gin 
CAG 


Val 
GTG 


Leu 
CTG 


Phe 
TTC 


Gin 
CAG 


Ala 
GCA 


40 

Gin 

CAG 


Pro 

CCG 


Ala 
GCC 


Val 
GTG 


lie 
ATC 


Lys 
AAG 


b(J 

Leu 

CTG 


Val 
GTC 


Asn 
AAC 


He 
ATC 


Gly 
GGT 


Asp 
GAC 


60 

Ala < 
GCC 


Giu 
GAG 


Gly 
GGC 


Gly 
GGC 


Asn 
AAC 


Phe 
TTC 


70 

Leu 

CTC 


Glu 
GAG 


Asn 
AAC 


Leu 
CTC 


Gly 
GGC 


Phe 
TTC 


80 

Thr 

ACG 


Lys 
AAG 


Ser 
TCG 


Ala 
GCC 


Gly 
GGC 


Leu 
CTG 


Asn 
AAC 


He 
ATC 


Val 
GTG 


Gly 
GGC 


Lys 
AAG 


Gly 
GGC 


Gly 
GGC 


Lys 
AAG 


Asn 
AAC 


Leu 
CTG 


His 
CAC 


Met 
ATG 


Asp 
GAC 


Thr 
ACC 


Val 
GTC 


He 
ATT 


Leu 
CTC 


Ala 
GCG 


Lys 
AAG 


Ala 
GCC 


130 
Glu 
GAA 


Gly 
GGC 


Asp 
GAC 


Lys 
AAG 


Ala 
GCC 


He 
ATC 


140 
Ala 
GCC 


Asp 
GAC 


Asp 
GAC 


Lys 
AAG 
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Gly 
GGC 


Gly 
GGC 


Asn 
AAC 


Leu 
CTC 


Lys 
AAG 


Leu 
CTG 


Arg 
CGC 


Asp 
GAC 


Tyr 
TAC 


Phe 
TTC 


Asn 
AAC 


Thr 
ACC 


180 
Phe 
TTC 


Gly 
GGC 


Ser 
TCG 



VJIU 

GAA 


190 
Ala 
GCC 


i_y;> 

AAG 


Ser 
TCC 


Phe 
TTC 


200 
Glu 
GAG 


Glu 
GAA 


Lys 
AAA 


Leu 
CTC 


lie 

ATC 


Ala 
GCC 


Tyr 
TAC 


Gly 
GGC 


Lys 
AAG 


Ala 
GCC 


Pro 

CCC 


Glu 
GAG 


Leu 
CTG 


Glu 
GAG 


Ala 
GCT 


Ser 
TCC 


Met 
ATG 


Asn 
AAC 


He 
ATC 


Leu 
CTG 


Arg 
CGC 


Phe 
TTC 


270 
Ala 
GCC 


Gly 
GGC 


Asn 
AAC 


Ala 
GCC 


280 
Ser 
AGC 


Ala 
GCC 


Arg 
CGC 


Tyr 
TAC 


290 
Ala 
GCG 



Ala 


Val 


He 


GCG 


GTC 


ATC 


Leu 


Lys 


Glu 


CTG 


AAG 


GAA 


Gly 


Thr 


He 


GGC 


ACC 


ATC 


Asp 


Glu 


Glu 




GAG 


GAA 


Arg 


Asp 


Leu 


CGC 


GAC 


CTG 


Leu 


Ala 


Asp 


CTG 


GCC 


GAC 


Pro 


Thr 


Ser 


CCC 


ACC 


AGC 



210 



Ser 
TCG 


Leu 
CTG 


Gly 
GGC 


Val 
GTG 


220 
Gin 
CAG 


Val 
GTC 


Ser 
TCG 


His 
CAT 


230 
Ala 
GCC 


Gly 
GGC 


Val 
GTG 


Asn 
AAC 


Asp 
GAC 


Leu 
CTC 


Val 
GTG 


Asp 
GAC 


Asp 
GAC 


Lys 
AAG 


Asn 
AAC 


Trp 
TGG 


Thr 
ACC 


Val 
GTC 


- Ser 
TCG 


Asn 
AAC 


Thr 
ACG 


Leu 
CTG 


Asn 
AAC 


Arg 
CGC 


Asn 
AAC 


Glu 
GAG 



150 



Leu 


His 


Thr 


CTG 


CAC 


ACG 




160 




Tyr 


Gly 


Val 


TAC 


GGC 


GTG 






170 


Thr 


Val 


Leu 


ACC 


GTG 


CTG 


Lys 


Gly 


Ser 


AAG 


GGT 


TCC 


He 


Gin 


Glu 


ATC 


CAG 


GAA 


Tyr 


Val 


Leu 


TAC 


GTG 


CTC 


Ala 


Gly 


Asp 


GCA 


GGC 


GAC 


Thr 


Ser 


Gly 


ACC 


TCG 


GGC 



Asn lie Thr 

AAC ATC ACC 



Gly Ala Ala 

GGC GCC GCG 



240 



Ala 
GCG 


Leu 
CTG 


Val 
GTC 


Leu 
CTG 


250 
Arg 
CGC 


Thr 
ACG 


Ala 
GCG 


Lys 
AAG 


260 
Asn 
AAC 


He 
ATC 


Ala 

GCC 


Lys 
AAG 


He 
ATC 


He 
ATC 


Pro 
CCC 


Ala 
GCC 


Asp 
GAC 


Val 
GTG 


Asp 
GAC 


Phe 
TTC 


Asp 
GAC 
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300 



w 



20 



35 



40 



45 



50 



Ala 
GCC 


Ala 
GCC 


Met 
ATG 


Lys 
AAG 


Thr 
ACG 


Leu 
CTG 


Glu 
GAA 


'Glu 
GAG 


Arg 
GGC 


Ala 
GCG 


Gin 
CAG 


Gin 
CAG 


Lys 
AAG 


310 
Lys 
AAG 


Leu 
CTG 


Pro 
CCC 


Glu 
GAG 


Ala 
GCC 


Asp 
GAC 


Val 
GTG 


Lys 
AAG 


Val 
GTG 


lie 
ATC 


320 
Val 
GTC 


Thr 
ACG 


Arg 
CGC 


Gly 
GGC 


Arg 
CGC 


Pro 
CCG 


Ala 
GCC 


Phe 
TTC 


• Asn 
AAT 


Ala 
GCC 


330 
Gly 
GGC 


Glu 
GAA 


Gly 
GGC 


Gly 
GGC 


Lys 
AAG 


Lys 
AAG 


Leu 
CTG 


Val 
GTC 


Asp 
GAC 


Lys 
AAG 


340 
Ala 
GCG 


Val 
GTG 


Ala 
GCC 


Tyr 
TAC 


Tyr 
TAC 


Lys 
AAG 


Glu 
GAA 


Ala 
GCC 


Gly 
GGC 


Gly 
GGC 


350 
Thr 
ACG 


Leu 
CTG 


Gly 
GGC 


Val 
GTG 


Glu 

GAA 


Glu 
GAG 


Arg 
CGC 


Thr 
ACC 


Gly 
GGC 


Gly 
GGC 


360 
Gly 
GGC 


Thr 
ACC 


Asp 
GAC 


Ala 
GCG 


Ala 
GCC 


Tyr 
TAC 


Ala 
GCC 


Ala 
GCG 


Leu 
CTC 


Ser 
TCA 


370 
Gly 
GGC 


Lys 
AAG 


Pro 
CCA 


Val 

GTG 


He 
ATC 


Glu 
GAG 


Ser 
AGC 


Leu 
CTG 


Gly 
GGC 


Leu 
CTG 


380 
Pro 
CCG 


Gly 
GGC 


Phe 
TTC 


Gly 
GGC 


Tyr 
TAC 


His 
CAC 


Ser 
AGC 


Asp 
GAC 


Lys 
AAG 


Aia 
GCC 


390 
Glu 
GAG 


Tyr 
TAC 


Val 
GTG 


Asp 
GAC 


lie 
ATC 


Ser 

AGC 


Ala 
GCG 


He 
ATT 


Pro 
CCG 


Arg 
CGC 


400 
Arg 
CGC 


Leu 
CTG 


Tyr 
TAC 


Met 
ATG 


Ala 
GCT 


Ala 
CGC 


Arg 
CGC 


Leu 
CTG 


He 
ATC 


Met 
ATG 


410 
Asp 
GAT 


Leu 
CTG 


Gly 
GGC 


Ala 
GCC 


Gly 
GGC 


Lys 
AAG 


TGA 


—3' 
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8. Un vecteur de transfert d'ADN recombinant selon Tune quelconque des revendtcations precedentes 
caracterise en ce que le vecteur de transfert est un plasmide. 

9. Un micro-organisme transforme par un vecteur de transfert caracterise en ce que le vecteur de 
transfert est un vecteur de transfert d'ADN recombinant selon Tune quelconque des revendications 
precedentes 1 a 8. 

10. Un micro-organisme selon la revendication 9 qui est une bacteria de I'espece E. coli ou 
Pseudomonas ou une levure de I'espece Saccharomyces cerevisiae. 

11. Un procede pour la preparation d'un produit genique caracterise par 

(a) la culture d'un micro-organisme selon soit la revendication 9 soit la revendication 10 dans un milieu 
de culture pour produire le produit genique dans le milieu de culture ou dans I'espace peripiasmique du 
micro-organisme et 
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(b) I'isolement du produit genique du milieu de culture ou de I'espace periplasmique du micro- 
organisms se)Qn |g revendicat [ on n caracterise en ce que !e produit genique est la 

carboxvpeptidase G 2 de Pseudomonas. m 
5 13. Un procede selon la revendication 11 caracterise en ce que le produit genique est une proteme ou 
un polypeptide autre que la carboxvpeptidase G 2 de Pseudomonas. 

14. Un procede selon la revendication 13 caracterise en ce que le produit genique est une proteme 
procaryotique autre que la carboxvpeptidase G 2 de Pseudomonas. 

1 5. Un procede selon la revendication 14 caracterise en ce que le produit genique est la p-galactosidase 
to de E. coli. 
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RESTRICTION ENZYME MAP OF pNM14 

Fig.3. 
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Soy3A FRAGMENT 
GATCC xGATC 

s 

(A) 



/ 

/ 

/ 



— i — i — i : M13mp7 
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Fig.6. 
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a. t> 




I 1 pBR322 

CHI] Yeast 2/0 plasmid 

■■ Yeast chromosomal leu 2 gene 

FTvTl Pseudomonas carboxypeptidase G2 gene 

Fig. 7 

RESTRICTION ENZYME MAP OF pLEC 3 
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